AI Coding Agents Introduce Vulnerabilities in 87% of Pull Requests Across Claude Code, Codex, and Gemini
The headline number is uncomfortable: 87%. That’s the share of pull requests containing at least one security vulnerability when AI coding agents — Claude Code, OpenAI Codex, and Google Gemini — were used to build real applications from scratch. That’s the finding from DryRun Security’s inaugural Agentic Coding Security Report, published this week and already making waves through security and developer communities. This isn’t a synthetic benchmark. DryRun tested three leading AI coding agents building two real applications each, generating approximately five pull requests per agent. The result: 143 total vulnerabilities documented across 30 pull requests. Nearly nine out of ten PRs had at least one problem. The two leading failure modes were access control gaps and improper token handling. ...