Multi-Agent Trust Architecture
Implicit Trust vs. Zero-Trust MAS · Attack Surface Analysis · AgentFence Taxonomy
Implicit Trust (Typical Production)
Global shared state · No output sanitization · Ambient authority · No identity binding
Zero-Trust MAS (ZTMAS)
Explicit trust levels · Schema-gated comms · Task-scoped tools · Guardian validation
Failure Modes · Architecture A
Executor retains full tool scope regardless of task. Compromised peer triggers high-side actions without permission check.
Raw MCP and RAG outputs written directly to shared state. Adversarial content steers downstream agent planning.
Data plane crosses into control plane with no architectural barrier. Low-trust content issued as high-trust directive.
VSCode Copilot auto-initiates Dynamic Client Registration on HTTP errors. Social engineering surface and OAuth callback capture.
Controls Applied · Architecture B
Explicit L0/L1/L2 trust levels per agent. Control plane separated from data plane at the architectural level.
All tool outputs schema-validated as NonInstructionalData before context injection. Pydantic enforcement at every external boundary.
Hardened validator inspects all A2A payloads for Instructional Drift and Wrong-Principal Actions before execution.
Minimum necessary tool access per task. Access revoked on completion. Blast radius isolated per node.