Point C1

Capability Contracts for Agent Networks

Thesis

Agent systems should be organized around replaceable capabilities with explicit contracts, not around vague role prompts or artificial replicas of human organizations.

Human organizations contain useful coordination functions: planning, execution, review, escalation, policy, memory, specialization, and accountability. But copying the surface form of human organizations into AI systems often creates theater. A “manager agent” does not have human judgment. A “research intern agent” does not have employment incentives. A “legal department agent” does not carry professional liability.

The better move is to extract capabilities and give each capability a contract.

What Is A Capability?

A capability is a bounded function the system can call under a contract.

Examples:

  • retrieve relevant context
  • inspect a codebase
  • run tests
  • summarize a document
  • classify a clause
  • verify citations
  • compare alternative plans
  • enforce a policy rule
  • refresh stale context
  • produce a review packet
  • estimate risk
  • route to a human

A capability may be implemented by a model, a tool, an agent workflow, a deterministic service, a human process, or a hybrid. The implementation can change as models improve. The contract should remain legible.

Capability Contract

FieldQuestion
PurposeWhat does this capability do?
InputsWhat context or artifacts does it require?
OutputsWhat does it produce?
Allowed toolsWhat can it access?
Data boundaryWhat can it read, write, store, or send?
EvidenceWhat proof does it attach?
Failure modesHow does it fail or signal uncertainty?
Escalation routeWhere does unresolved control go?
TelemetryWhat outcome data improves the capability?
ReplaceabilityHow can another implementation substitute for it?

This looks like engineering discipline because it is. Agent systems become harder to reason about when capabilities are hidden inside prompts or personality roles.

Why Contracts Matter More Than Role Names

A role name says what a capability is supposed to be. A contract says how it can be trusted, replaced, audited, and constrained.

Weak role promptStronger capability contract
”Act as a senior reviewer.”Verify evidence against scope, cite failed checks, return confidence and unresolved risks.
”Be a project manager.”Compare active delegations, detect conflicts, propose next-best-control routes.
”Be a legal assistant.”Extract clauses, preserve references, apply rubric, avoid legal advice, require human review.
”Be a researcher.”Build claim map, rate source quality, surface counterarguments, mark freshness.

This is also how the network stays replaceable. The implementation can shift from one model to another, or from a model to a deterministic tool, if the contract remains stable.

Network, Not Org Chart

flowchart TB
    D["Delegation record"] --> R["Context retrieval"]
    D --> P["Planner"]
    P --> E["Executor"]
    E --> V["Verifier"]
    V --> A["Arbiter"]
    A --> E
    V --> PK["Policy gate"]
    PK --> H["Human/principal"]
    E --> M["Memory/update"]
    V --> D
    A --> D
    PK --> D

This graph is not an organization. It is a control network. Capabilities can be swapped, upgraded, or disabled. Some may be domain-specific. Some may become unnecessary as model intelligence improves. Some should remain separate because separation improves evidence, review, security, or accountability.

Why Domain-Specific Capabilities Matter

A code verifier and a legal verifier are not the same capability.

Coding verification can inspect diffs, run tests, check types, and compare against scope. Legal verification may need clause references, jurisdiction awareness, confidentiality rules, and human counsel review. Research verification needs source quality, counterarguments, freshness, and claim strength.

The network should therefore support both generic protocols and domain-specific capabilities.

Protocols Are Not Governance

Protocols and connectors make capabilities easier to call. Model Context Protocol and similar tool-connection patterns point toward a world where agents can access many tools and data sources through common interfaces.

That is useful, but interoperability is not governance. A protocol can say how to call a tool. It does not automatically decide whether the agent should call it, what data may be sent, what evidence must be retained, who reviews the result, or what happens when the tool output is wrong.

Capability networks need both connection and control.

Telemetry Without Surveillance

Capabilities should improve from outcome data. A verifier should learn which checks caught real defects. A router should learn which escalations were useful. A context refresher should learn when stale context caused failures.

But telemetry can become surveillance or retention risk. Useful telemetry should be scoped to capability improvement, stripped of unnecessary private content, and governed by retention rules. The system should learn from outcomes without preserving every private interaction forever.

Claim Support

ClaimSource supportConfidenceCaveat
Workflows and agents should start simple and use explicit patterns where possible.Anthropic, “Building effective agents.”MediumThis is a design guide, not a formal standard.
Tool/data interoperability does not itself solve security or governance.Model Context Protocol introduction and security best practices.MediumMCP is one protocol family, not the whole ecosystem.
Cognition and coordination can be distributed across people, artifacts, and environment.Hollan, Hutchins, and Kirsh on distributed cognition.MediumAnalogy must not imply agents have human institutional responsibility.

Practical Takeaway

When designing an agent system, replace title questions with capability questions.

Instead of asking:

  • Who is the manager agent?
  • Who is the researcher agent?
  • Who is the reviewer agent?

Ask:

  • What capability is needed?
  • What contract governs it?
  • What evidence does it return?
  • What control locus handles failure?
  • Can another implementation replace it?
  • What telemetry improves it safely?

Bridge To Article 7

Capability contracts are only meaningful when they reflect domain risk. The same delegation pattern behaves differently in coding, legal, finance, government, education, and research.

Sources

Agent Involvement

This article was prepared with AI assistance from a sanitized research discussion and public sources. The human maintainer approved this publication package on 2026-06-28. Treat the design primitives as exploratory proposals, not settled standards.

Sources Sources used 4 sources Show sources Hide sources

Look closer

Sources and notes

Open details Close details

These notes collect the sources, counterpoints, and review status behind the article's important points. Read the essay first; open this when you want to check something.

Confidence reflects how strongly the sources support the point (low / medium / high). Status describes the point's role (e.g., core, argument, landscape). Sources link to supporting material; counterpoints note boundary conditions or conflicting findings.

C001 medium argument

Agent systems should be organized around replaceable capabilities with explicit contracts, not around vague role prompts or artificial replicas of human organizations.

verified reviewed 2026-06-28

Sources (4)
Counterpoints (1)
  • The delegation record and next-best-control models are proposed design primitives, not accepted standards.

Review recordHow this was madeShow detailsHide details

Created 2026-06-28 by codex-agent. Policy: policy:default v1.0.0.

✓ Approved hash matches current article

Agent runs

  • drafting-and-site-previewgpt-52026-06-28in:5aa76f20…out:d62f207c…

Reviews

  • sibling-agentcommented2026-06-28

    Scope: series structure, evidence anchoring, privacy

    Earlier sibling review identified source-ledger gaps; those gaps were resolved before publication with article-specific source alignment.

  • humanapproved2026-06-28

    Scope: publication, reader experience, privacy, series structure

    contentHash: e2828555014eaf33…

    Human maintainer approved the local preview for website publication after article layout, reading-flow, privacy, and series-structure review.

  • sibling-agentapproved2026-06-28

    Scope: publication-gate, source alignment, provenance, privacy, generated artifacts

    contentHash: e2828555014eaf33…

    Independent sibling review approved the final publication packet with no blocking findings after source-ledger, provenance, privacy, and generated-artifact checks.