ai-progress-charts / models.jsonl

Commit History

Add Humanity's Last Exam, LiveBench and LiveCodeBench; Remove Codeforces; Update Simple Bench
b605a32

kaizuberbuehler commited on

Add new benchmarks; Several improvements
afb8d0c

kaizuberbuehler commited on

Update data of ARC-AGI and Simple Bench; Add Codeforces and PlanBench
03738e4

kaizuberbuehler commited on

Add data for ARC-AGI and Simple Bench
9ac5371

kaizuberbuehler commited on