Running 1.24k 1.24k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
cognitivecomputations/Dolphin3.0-R1-Mistral-24B Text Generation • Updated 7 days ago • 3.27k • 138
yentinglin/Mistral-Small-24B-Instruct-2501-reasoning Text Generation • Updated 2 days ago • 845 • 42
NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation • Updated 3 days ago • 5.38k • 253
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published 12 days ago • 32
unsloth/Mistral-Small-24B-Instruct-2501-unsloth-bnb-4bit Text Generation • Updated 20 days ago • 16.9k • 12
unsloth/DeepSeek-R1-Distill-Qwen-32B-unsloth-bnb-4bit Text Generation • Updated 7 days ago • 3.36k • 9
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published Jan 21 • 51
RL + Transformer = A General-Purpose Problem Solver Paper • 2501.14176 • Published 29 days ago • 24
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer Paper • 2501.15570 • Published 27 days ago • 23