Toolkits (DUPLICATE them, never use the public ones) A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!! Running 105 105 AutoTrain Advanced 🚀 Create powerful AI models without code Runtime error 39 39 LLM Merge Adapter 🐢 Sleeping 284 284 mergekit-gui 🔀 Merge AI models using a YAML configuration file
Benchmarks Most commonly used leaderboards to check model capabilities Running on CPU Upgrade 13.4k 13.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots Running 359 359 LLM Performance Leaderboard 🐨 View LLM performance rankings Running 4.56k 4.56k LMArena Leaderboard 🏆 Display LMArena Leaderboard Running on CPU Upgrade 6.16k 6.16k MTEB Leaderboard 🥇 Embedding Leaderboard
Running on CPU Upgrade 13.4k 13.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
Toolkits (DUPLICATE them, never use the public ones) A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!! Running 105 105 AutoTrain Advanced 🚀 Create powerful AI models without code Runtime error 39 39 LLM Merge Adapter 🐢 Sleeping 284 284 mergekit-gui 🔀 Merge AI models using a YAML configuration file
Benchmarks Most commonly used leaderboards to check model capabilities Running on CPU Upgrade 13.4k 13.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots Running 359 359 LLM Performance Leaderboard 🐨 View LLM performance rankings Running 4.56k 4.56k LMArena Leaderboard 🏆 Display LMArena Leaderboard Running on CPU Upgrade 6.16k 6.16k MTEB Leaderboard 🥇 Embedding Leaderboard
Running on CPU Upgrade 13.4k 13.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots