
When OWASP first published this report in July 2025, agentic AI security was framed as a portfolio of plausible threats. The report surveyed an early ecosystem, mapped likely failure modes, and called for governance to keep up.
A year later, the framing has shifted. The 2026 version is now live, and almost every threat described as plausible in the first paper, now has a CVE, vendor advisory, or production incident attached to it.
I joined the 2026 project and contributed to the Threat Landscape chapter. This post covers what changed in the threat picture, and what changed in the report itself.
1. The threats are operational - The first version reads like a risk assessment of an emerging surface. The 2026 paper reads like a postmortem catalogue. The Threat Analysis chapter places each pattern in the context of where agentic capability was expanding when the break happened, prompt injection scaling through indirect channels, supply chain compromise of MCP servers and skill registries, identity gaps in non-human identity, and sandboxes designed for human operators failing under agent execution. Every section is grounded in documented incidents. The companion Real-World Incidents and Exploits Tracker curates the public record and maps each event to the OWASP Top 10 for Agentic Applications.
2. Safety and security converge at the deployment layer. This is the most consequential framing change in the 2026 paper. Model-level safety remains the provider’s responsibility, but once an agent is acting on production systems, the same controls govern both kinds of harm and the same investigation surfaces both kinds of cause. The Replit production database deletion and the Cursor allowlist bypass have different cause stories: one had no adversary, the other required one, but the permission architecture that produced both is identical. The 2026 paper argues that organizations cannot continue to run AI Safety and AI Security as parallel functions.
3. Governance is being measured in hours. DORA’s four-hour notification, NIS2’s 24-hour early warning, NY RAISE’s 72-hour frontier reporting, and CA SB 53’s 15-day window all assume continuous oversight rather than periodic audit. The 2026 paper covers 42 regulatory instruments across 10 jurisdictions and traces what live monitoring, drift detection, automated incident routing, and agent-speed kill mechanisms actually require to operate.
The practical starting point in the 2026 paper is to identify the most advanced agents you are running today, then either raise governance maturity to match or reduce the deployment tier. Shadow AI deserves particular attention: it is present in nearly every organization contributors examined and must be discovered before it can be governed.
Read the full report: OWASP State of Agentic AI Security and Governance 2026

The security risks of AI agents are no longer theoretical. This blog examines the active threat landscape facing agentic AI in 2026, from prompt injection and supply chain attacks against MCP and skill registries to the governance gap created by vibe coding and Shadow AI.

Guardian agents are emerging as a critical security layer for the agentic AI era. As enterprises adopt AI agents that execute tools, handle sensitive data, and operate inside real workflows, human approval loops no longer scale. Guardian agents solve this by supervising other agents in real time: monitoring actions, enforcing policy, and blocking risky behavior before execution.
.png)
Capsule found two Cursor IDE vulnerabilities that let hidden prompt-injection instructions in referenced files steal developers’ SSH keys and contaminate future unrelated projects, causing zero-click or one-click exfiltration even when the attacker ships no malicious code.

Capsule Security’s State of AI Agent Security 2026 report is the largest independent audit of AI agents to date, showing that the ecosystem is rapidly shipping publicly exposed, weakly guarded, highly connected agents with recurring misconfigurations, near-absent runtime controls, widespread prompt-injection risk, expanding supply-chain exposure, and active malicious campaigns still propagating through agent skill and tool registries.

Capsule is launching a runtime security platform for the agentic AI era, built to monitor and stop autonomous agents that can bypass traditional guardrails, misuse legitimate access, and create a new class of enterprise security risk.

Capsule research team discover a critical prompt injection vulnerability in Salesforce Agentforce that allows attackers to exfiltrate CRM data through a simple lead from a form submission. No authentication required.