Running on CPU Upgrade 73 73 AIR-Bench Leaderboard π₯ Explore and compare QA and long doc benchmarks
Running on CPU Upgrade 119 119 Open Chinese LLM Leaderboard π Browse and submit models in an evaluation leaderboard