deepseek-ai/DeepSeek-R1-Distill-Qwen-7B Text Generation β’ Updated about 1 month ago β’ 1.35M β’ 569
Running 116 116 Open-LLM performances are plateauing, letβs make the leaderboard steep again π Update leaderboard for fair model evaluation
Running 895 895 FineWeb: decanting the web for the finest text data at scale π· Generate high-quality web text data for LLM training