Running on CPU Upgrade 68 68 AIR-Bench Leaderboard π₯ Explore benchmark results for QA and long doc models
Running on CPU Upgrade 113 113 Open Chinese LLM Leaderboard π Display and filter LLM benchmark results
Running on CPU Upgrade 12.8k 12.8k Open LLM Leaderboard π Track, rank and evaluate open LLMs and chatbots