Skip to main content
Field notes / architecture

Why proximity matters: tools, meaning, and memory belong together

Most AI agent stacks are gateways wrapped in auth. The hard work is not routing tool calls; it is making sure context arrives with them.

12-minute readArchitecture

The integration thesis

An AI agent operating against enterprise data needs three things to be useful. It needs tools, so it can act. It needs meaning, so it knows what its actions return. It needs memory, so the next agent in its place inherits what the last one figured out. The naive stack treats these as three separate problems solved by three separate products: a gateway for tools, a catalog for meaning, a memory store for state. The integrated stack treats them as one problem solved at one layer.

Plexara is built around the second view. The MCP envelope that carries a tool call to a backend system also carries the catalog context that explains the response and the knowledge captured from the last hundred sessions that asked something similar. The agent does not have to ask for context; context arrives with the call. The agent does not have to consult a memory store; memory has already shaped the response. Tools, meaning, and memory share the same envelope.

This article argues why that proximity matters. Not as a vendor claim, but as a sequence of concrete consequences for hallucination rates, latency budgets, audit posture, and adoption gradients. The thesis is simple. Co-locating tools, meaning, and memory inside one MCP envelope produces an agent that is qualitatively different from a tool-caller wrapped in auth.

Tools without context hallucinate

Most AI agent platforms today are gateway products. They authenticate the agent, route tool calls to backend systems, and return whatever the backend returned. The agent gets the rows but not the meaning of the columns. It gets the response but not the provenance. It does not know that the customer_id field is a foreign key to a deprecated schema, or that the price column is denominated in cents and not dollars, or that the row count it just retrieved excludes a tenant that was migrated last quarter.

In this configuration, the agent has to guess. The model is good at guessing, but enterprise data is full of hidden landmines that no amount of training can prepare it for: tribal naming conventions, soft-deleted records, business rules encoded in trigger logic, columns whose semantics changed three years ago when an integration was rewritten. An agent without semantic grounding produces queries that look right but compute the wrong number.

The fix is not a smarter model. The fix is making sure the response includes what the response means. A tool call that returns a table should also return the catalog metadata for that table: ownership, sensitivity classification, deprecation status, glossary terms applied to each column, recent changes from the last data contract revision. The agent reasons about an enriched response, not a bare one. Hallucination drops because the floor of available context is higher.

Context without action is academic

The mirror failure mode is a catalog the agent can read but not act on. Some enterprises have invested heavily in metadata catalogs, glossaries, and data contracts. The catalog is rich. The agent can browse it. What the agent cannot do is execute a query against the data described by the catalog. The catalog and the execution engine live in different products, with different auth, different APIs, and different latency budgets. The agent ends up doing five hops to answer one question, three of them metadata and two of them data.

This is the gateway problem in reverse. A pure gateway gives the agent action without context. A pure catalog gives the agent context without action. Neither is autonomous. Both require the agent to do orchestration work that should be a property of the platform. Plexara puts both in the same envelope: the catalog context that describes a dataset arrives with the data when the agent queries it. There is no separate catalog round trip. The action and the context fuse at the protocol layer.

The latency tax of separate products

Latency is the easiest number to put on the proximity argument. Suppose an agent answers a single business question by calling one query tool. In a separate-product stack the agent has to coordinate at least three round trips: one to the gateway to call the query tool, one to the catalog to fetch metadata for the result, one to the memory store to retrieve any prior knowledge about how this question has been answered before. Each round trip is a network hop, a token-budget burn, and an opportunity for the orchestration to fail.

In the integrated stack the same question is one round trip. The agent calls the tool. The response includes the catalog context and the applied knowledge. The agent reasons over the unified payload. Three round trips become one, three sets of credentials become one, three potential failure modes become one. At the scale of autonomous agents running thousands of questions per day, the integration is not a quality-of-life win. It is the difference between a feasible system and an infeasible one.

The latency math is also why bolting a gateway in front of a separate catalog and memory product never quite works. The gateway can authenticate and route, but it cannot atomically merge the responses from three independent backends without becoming an integration platform itself. The hard part of the integration is not auth. The hard part is fusing the responses at the protocol layer with consistent semantics. That is application logic, and it does not live at the gateway tier.

Knowledge closes the loop

The third capability in the envelope is memory, but memory in the Plexara sense is more than a vector store of past conversations. It is the closed loop where corrections from one session improve responses in the next. An analyst tells an agent that a column it just queried is deprecated; that observation flows back through the governance workflow and lands in the catalog as an annotation. The next agent that queries the same column in any session, for any persona, sees the deprecation warning attached to the response automatically.

In a separate-product stack this loop is broken. The memory store records that a given conversation flagged a deprecation, but the catalog never learns. The next agent has to re-discover the same fact. Tribal knowledge accumulates in conversation logs but never becomes institutional knowledge that improves the catalog. The platform does not improve with use; it merely accumulates a transcript.

Plexara treats every tool call as potential training signal for the catalog. Knowledge captured during one session, once reviewed through governance, enriches the catalog, which enriches future tool calls. The platform improves with use because the loop is closed at the protocol layer, not at the application layer. This is what proximity buys: the feedback path is shorter than the gap between products.

Trust boundaries belong at the tool-call layer

Persona-based access control, PII flags, deprecation notices, ownership boundaries: these are governance concerns that exist as catalog metadata. In a separate-product stack, applying them to a tool call requires the agent to do the right post-hoc lookups. An analyst persona queries a sensitive table; the agent must remember to consult the catalog to discover that the table is sensitive, then redact accordingly. The governance is correct only if the agent is disciplined.

Inside the Plexara envelope, governance applies at the tool-call layer automatically. The same response that carries the rows carries the persona-aware redaction, the PII flags applied to specific columns, the deprecation warning if the table should not be used for new analysis. The agent does not have to discover these constraints; they are properties of the response. Trust boundaries align with the unit of work.

For regulated industries this is not a convenience. It is the difference between a defensible audit trail and a brittle one. An auditor asking whether the agent saw the PII flag at the moment of the query gets a single timestamped record showing exactly what the agent received. In a separate-product stack the same answer requires reconstructing across the gateway log, the catalog log, and the agent transcript, hoping the timestamps line up. Governance proximity is governance honesty.

Adoption gradient: existing investments get smarter

A reasonable objection to the integrated thesis is that it sounds like a rip-and-replace platform. It is not. The two gateway capabilities, MCP Gateway and API Gateway, exist precisely to bring existing investments into the envelope without rewrite. The MCP Gateway pulls in any MCP-compatible server an organization already runs. The API Gateway pulls in any REST or GraphQL endpoint, with the same enrichment pipeline applied. Existing infrastructure does not get replaced; it gets enriched.

This matters for the buying motion. A team can adopt Plexara as the connective tissue across what they have rather than as a replacement for it. The first agent the team deploys uses the API Gateway to reach an internal service that already exists. The response comes back enriched with catalog context and applied knowledge. The team did not rewrite the service. They added a layer that makes every call to it smarter.

The adoption gradient also reverses a subtle assumption about agent platforms. Many platforms are built on the idea that agents need new tools written specifically for them. Plexara is built on the idea that agents should make better use of the tools that already exist. The MCP and API gateways are the path. The enrichment pipeline is the value.

A four-capability map for senior data leaders

A senior data leader evaluating an AI platform is usually asking four questions in some order. Will my agents hallucinate? Will I lose the tribal knowledge that lives in my analysts' heads? Can I integrate this with the data infrastructure I already paid for? Can I prove what the agent saw when something goes wrong? Each of these questions maps directly to one of the four capabilities in the Plexara envelope.

Hallucination maps to semantic enrichment. Tribal knowledge maps to knowledge capture. Integration maps to the two gateways. Audit posture maps to the persona-aware governance proximity that comes with all of the above. The questions are not independent and the answers are not independent; the whole point of the integrated stack is that solving one problem requires the others to be in proximity.

A vendor selling only a gateway, only a catalog, or only a memory store can answer one question well and the others poorly. A platform that puts all three in the same envelope answers them together because the answers depend on each other. That dependency is the moat. It is also why the integrated story is hard to copy by adding features to a single-capability product.

When proximity matters most

Proximity is a free lunch when you are answering one question with one tool. It is everything when you are running autonomous agents at scale, in regulated industries, across multi-team data estates that none of the agents have full mental models of. The harder the question, the more the agent has to lean on the platform rather than on the model. The more the agent leans on the platform, the more the gaps between separate products become the dominant failure mode.

The bet behind Plexara is that the next phase of enterprise AI is not better models. It is better platforms that make existing models useful against existing data. The gateway-only architectures will get the easy questions right and the hard questions wrong. The integrated architectures will get the hard questions right because the hard questions require tools, meaning, and memory all at once.

Tools without context hallucinate. Context without action is academic. Memory without a loop is a transcript. In the same envelope, all three become the substrate for an agent that can do real work without needing the human to orchestrate. That is what proximity buys.