DryRun Security: Claude Generates More Unresolved Security Flaws Than Codex or Gemini in Real Apps
Anthropic has built its brand on safety. Claude is consistently positioned as the thoughtful, cautious model — the one that pushes back on dangerous requests, that thinks about consequences, that errs on the side of care. So the DryRun Security research published today will raise some eyebrows: when used as an agentic coding agent building real applications, Claude produces the highest number of unresolved high-severity security flaws among the leading AI coding agents tested. ...