MolmoWeb: Ai2's Open-Source Web Browser Agent Beats GPT-4o at Just 8 Billion Parameters
The Allen Institute for AI (Ai2) just dropped something the open-source AI community has been waiting for: a fully open, genuinely capable web browser agent that can go head-to-head with GPT-4o-based systems — at 8 billion parameters. It’s called MolmoWeb, and it’s available right now on Hugging Face under Apache 2.0. What MolmoWeb Actually Does MolmoWeb is a multimodal web agent. You give it a natural-language instruction, and it autonomously controls a real web browser: clicking, typing, scrolling, navigating, filling forms. It understands the web visually — through screenshots — rather than through structured DOM parsing. ...