Running on CPU Upgrade 250 250 GPT-OSS-120B on AMD MI300X 💻 gpt-oss-120b model running on AMD MI300 infrastructure.
Running 3.11k 3.11k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 654
Running 50 50 Stick To Your Role! Leaderboard 🎭 Benchmarking LLMs on the stability of simulated populations
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 69
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 69