Running on CPU Upgrade Agents 26 Gaia2 Agents Evaluation Leaderboard 🐠 26 View and compare Gaia2 benchmark leaderboards for AI models
Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments Paper • 2602.11964 • Published Feb 12 • 13
Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments Paper • 2602.11964 • Published Feb 12 • 13
view article Article Gaia2 Leaderboard Update: New Models and New Observations meta-agents-research-environments • Oct 2, 2025 • 10
view article Article Gaia2 Leaderboard Update: New Models and New Observations meta-agents-research-environments • Oct 2, 2025 • 10
Running on CPU Upgrade Agents 26 Gaia2 Agents Evaluation Leaderboard 🐠 26 View and compare Gaia2 benchmark leaderboards for AI models
Running 48 Meta Agents Research Environments Demo 🚀 48 Explore Meta Agents research environments via web interface
view article Article Gaia2 and ARE: Empowering the community to study agents +9 clefourrier, gregmialz, mlcu, mortimerp9, XciD, tfrere, evijit, RomainFroger, dheeraj7596, CarolinePascal, upiter • Sep 22, 2025 • 134
view article Article Gaia2 and ARE: Empowering the community to study agents +9 clefourrier, gregmialz, mlcu, mortimerp9, XciD, tfrere, evijit, RomainFroger, dheeraj7596, CarolinePascal, upiter • Sep 22, 2025 • 134
Running 48 Meta Agents Research Environments Demo 🚀 48 Explore Meta Agents research environments via web interface