Agent Session Smuggling: Hidden Instructions Across A2A Agent Sessions
Unit 42 describes Agent Session Smuggling, a research attack pattern where a malicious remote A2A agent uses an established stateful session to inject hidden instructions into a victim agent. In the proof of concept, a financial assistant delegates market-news research to a malicious research assistant, which then leaks internal context and triggers an unauthorized stock purchase.
The lesson is that cross-agent session state is not trust. Remote-agent messages need identity, delegation lineage, task scope, data provenance, and high-impact action gates.
Threat Analysis
- The user starts with a normal delegated task. A financial assistant asks a remote research assistant for market news, creating a stateful A2A session.
- The remote agent turns that session into a covert instruction channel. Between the user's request and the expected answer, the malicious research assistant sends extra instructions that are not visible to the user.
- The first proof of concept leaks internal context. The remote agent induces the financial assistant to reveal chat history, system instructions, available tools, and tool parameter schemas.
- The second proof of concept crosses into action. After learning the tool surface, the malicious remote agent causes the financial assistant to call
buy_stockand purchase 10 shares without user authorization. - A2A itself is not the disclosed vulnerability. The risk comes from implicit trust in a stateful cross-agent channel, where a remote agent response may inherit client-agent authority just because it arrives inside a valid delegated session.
Applicable AIDEFEND Defenses (9)
buy_stock, sell_stock, profile access, or portfolio-management tools into the reachable flow.What Defenders Should Do Now
- Inventory every A2A or multi-agent integration where one agent can delegate work to a remote agent, especially when the client agent has financial, administrative, code-execution, memory-write, or data-export tools.
- Require signed remote-agent identity and preserve delegation lineage through every message. Record the original user intent, the client agent, the remote agent, the delegated task, the session ID, and the allowed capability scope.
- Build a narrow authority envelope for each delegated session. A research delegation should not inherit trading, account-management, secrets, system-prompt, or tool-discovery authority from the client agent.
- Gate high-impact actions through an independent confirmation path. Do not let a remote agent's message, tool output, or conversation state become the only approval evidence for a trade, payment, destructive action, or sensitive-data disclosure.
- Tag remote-agent content as untrusted data and enforce data-flow policy at sinks. Block system instructions, chat history, user profiles, tool parameter schemas, credentials, and other sensitive values from leaving through the remote-agent channel unless policy explicitly permits it.
- Add stateful tests and monitoring for session drift. A test remote agent should try to move from the original task into tool discovery, hidden-context requests, and unauthorized actions so the team can verify that the chain fails before production users see it.
Conclusion
Agent Session Smuggling is useful because it names a failure mode that many multi-agent products will otherwise treat as normal delegation. The risky step is not only what the remote agent says; it is that the client agent may treat that message as trusted session context and carry it into privileged tools. AIDEFEND maps the practical defense to identity, delegation lineage, scoped authority, continuous authorization, two-channel validation, value-level data-flow control, trust-state demotion, stateful monitoring, and repeatable adversarial tests.