Google DeepMind's Aletheia: The AI Agent That Solves Real Mathematical Research
Winning a math olympiad is impressive. Doing original mathematical research — navigating thousands of papers, formulating novel conjectures, constructing long-horizon proofs — is something else entirely. Until recently, that gap separated AI benchmarks from actual scientific contribution. Google DeepMind’s Aletheia is the first system to credibly cross it. Deployed in December 2025 against a database of 700 open mathematical problems, Aletheia autonomously solved four open Erdős problems — longstanding conjectures that professional mathematicians had left unresolved. It didn’t just compute answers; it generated, verified, and revised proofs in natural language, with results that have contributed to peer-reviewed publications. ...