Skip to main content
Field notes / governance

Why point-solution catalogs and semantic layers are not enough

Data catalogs document data but cannot execute queries. Semantic layers define metrics but delegate execution. Neither provides unified context and access.

10-minute readGovernance

The catalog-execution gap

Data catalog vendors and semantic layer products are both adding MCP servers. This is a meaningful step: AI agents can now search the catalog, browse metadata, and in some cases write documentation back through MCP tool calls. The most capable catalog MCP servers support both read and write operations.

The fundamental limitation persists: catalogs document data but cannot execute queries. When an agent discovers a relevant dataset through a catalog MCP server, it needs a separate MCP server to actually query the data. The context about what the data means lives in one system. The ability to access the data lives in another.

This gap creates multi-hop agent workflows. The agent calls the catalog to find a dataset. Then calls the query engine to describe the schema. Then calls the catalog again for business context. Then calls the query engine to execute a query. Each hop is a separate tool call with its own latency, failure mode, and token cost.

Catalogs document data but cannot execute queries. Semantic layers define metrics but delegate execution. Neither provides unified context and access in a single endpoint.

The structural limitation

Multi-hop workflows and their failure modes

Multi-hop workflows are fragile. If the catalog MCP server returns a dataset name that does not exactly match the query engine table name, the agent must reconcile the mismatch. If the catalog uses one naming convention and the query engine uses another, the agent must translate. Each translation step is an opportunity for error.

Timeout and rate-limiting failures are amplified in multi-hop workflows. If the catalog call succeeds but the query engine call is rate-limited, the agent has partial context. If the query engine call succeeds but a subsequent catalog call for business context times out, the agent returns results without context. The user sees numbers without meaning.

Session coherence is another failure mode. If the catalog metadata changes between the discovery call and the query call, the agent may query a dataset that has been deprecated, renamed, or moved. Multi-hop workflows have no transactional consistency across the hops.

Multi-hop workflow

Agent trace · catalog MCP + query engine MCP

  1. 01
    catalog.searchcatalog

    Find a relevant dataset

    Returns a name that does not match the query engine

  2. 02
    query.describequery

    Inspect schema

    Naming convention mismatch; agent must translate

  3. 03
    catalog.entitycatalog

    Pull business context

    Times out; agent proceeds with partial context

  4. 04
    query.executequery

    Run the query

    Catalog says deprecated; query engine has no idea

    end
4 hops. 4 tool definitions. 4 round trips. No transactional consistency across any of it.
Every hop adds latency, a failure mode, and a token cost. The agent is responsible for reconciling information across systems that have no awareness of each other.

Semantic layers delegate execution

Semantic layer vendors face the same structural limitation from the opposite direction. They define metrics, dimensions, and business rules. They know that "revenue" means a specific calculation applied to specific columns with specific filters. What they cannot do is execute the query.

When an agent asks a semantic layer MCP server for revenue, the semantic layer translates the request into SQL and passes it to the underlying warehouse for execution. The semantic layer is a translation layer, not an execution layer. It depends on the warehouse connection being available, correctly configured, and authorized for the requesting user.

This delegation means the semantic layer cannot enforce governance at query time. It can define that certain metrics require certain permissions, but enforcement depends on the warehouse honoring those permissions. If the warehouse connection uses a service account with broad access, the semantic layer permissions are effectively bypassed.

Semantic layers delegate

Agent

“revenue last quarter”

Semantic layer

defines what “revenue” means

cannot execute

Warehouse

executes the translated SQL

enforcement lives here

If the warehouse connection uses a service account with broad access, every semantic layer permission is effectively bypassed.

The semantic layer is a translation layer, not an execution layer. It depends on the warehouse honoring the permissions it defines.

Governance enforcement gaps

When context lives in the catalog and execution happens in the query engine, governance policies defined in one system may not be enforced in the other. The catalog says a dataset is PII-classified. The query engine does not check the catalog before executing a query against that dataset.

This gap is not theoretical. It is the normal operating condition in most enterprises. Catalog governance policies are aspirational: they describe what should happen. Query engine access controls are operational: they determine what actually happens. The two are maintained by different teams using different tools with different update cycles.

AI agents amplify this gap because they interact with both systems programmatically at high speed. A human analyst might check the catalog before querying sensitive data. An AI agent will take the most efficient path to results, which may bypass the catalog entirely if the query engine is directly accessible.

Context + execution, unified

Agent

one tool call

Plexara

catalog · query · governance · lineage

in one response

  • One tool call, complete business context
  • No reconciliation between catalog and query engine names
  • No partial context from timed-out secondary calls
  • Governance enforced at execution, not aspirational
One call returns results with full business context. No reconciliation. No partial context. No governance gaps between systems maintained by different teams.

Context and execution in one endpoint

The architectural solution is to unify context and execution in a single platform. When the same system that executes queries also manages metadata, governance enforcement is inherent. A PII classification in the catalog directly affects what the query engine returns. An ownership change immediately updates access controls. A deprecation warning appears in every query response.

This unification eliminates multi-hop workflows. One tool call returns results with full business context. No reconciliation between catalog names and query engine names. No partial context from timed-out secondary calls. No governance gaps between systems maintained by different teams.

The integration tax of stitching together five vendors is eliminated because there is nothing to stitch. The catalog, query engine, governance layer, semantic enrichment, and agent framework are layers of one platform, not products from five companies.