GAIA release - a gaia-benchmark Collection

gaia-benchmark 's Collections

GAIA release

updated Nov 23, 2023

Gather the items of the GAIA release

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 218

Note The arxiv paper (arxiv.org/abs/2311.12983) describing the benchmark and dataset creation methodology.
Running on CPU Upgrade

452

452

GAIA Leaderboard

🦾

Submit and evaluate AI models on a leaderboard

Note The leaderboard itself with the scored models and information on how to submit a new model.
gaia-benchmark/GAIA

Updated Feb 13 • 11.6k • 361

Note The dataset with questions for the GAIA benchmark.
gaia-benchmark/results_public

Viewer • Updated about 9 hours ago • 347 • 3.46k • 15

Note Open dataset of submission results.