Running Agents 95 Nexus Function Calling Leaderboard π 95 Display benchmark results for models on various tasks
Running on CPU Upgrade Agents 609 GAIA Leaderboard π¦Ύ 609 Submit and view GAIA model evaluation leaderboard