pinned Running The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
weege007/Qwen2.5-1.5B-Instruct_grpo_Countdown-Tasks-3to4 Text Generation • 2B • Updated 20 days ago • 13
weege007/Qwen2.5-0.5B-Instruct_grpo_Countdown-Tasks-3to4 Text Generation • 0.5B • Updated 20 days ago • 14
weege007/llama-3-8b-bnb-4bit-alpaca-merged-16bit Text Generation • 8B • Updated Apr 22, 2024 • 115