Solo.io Open-Sources 'agentevals' at KubeCon — Continuous Scoring for Production AI Agents
Alongside Dapr Agents v1.0 and the CNCF AI Conformance Program updates, KubeCon Europe 2026 delivered a third piece of production AI agent infrastructure: agentevals, a new open-source project from Solo.io that brings continuous behavioral scoring to agent deployments. The problem agentevals addresses is deceptively simple to state and surprisingly hard to solve: how do you know if your production AI agent is still doing what it’s supposed to do? What agentevals Does Most AI agent evaluation today happens at development time — you run evals before deploying, decide the agent is good enough, and ship it. What happens after deployment is typically monitored through logs and user feedback, not through continuous automated assessment. ...