OpenEvals

community
Activity Feed

AI & ML interests

LLM evaluation

Recent Activity

thomwolfΒ  authored a paper 6 days ago
Robot Learning: A Tutorial
clefourrierΒ  updated a Space 11 days ago
OpenEvals/EvalsOnTheHub
clefourrierΒ  published a Space 12 days ago
OpenEvals/EvalsOnTheHub
View all activity

Articles

OpenEvals 's collections 5

Research collaborations
A small overview of our research collabs through the years
Archived Open LLM Leaderboard (2024-2025)
This leaderboard has been evaluating LLMs from Jun 2024 on IFEval, MuSR, GPQA, MATH, BBH and MMLU-Pro