view article Article There is no such thing as a tokenizer-free lunch By catherinearnett • 2 days ago • 51
view article Article Gaia2 and ARE: Empowering the community to study agents By clefourrier and 10 others • 6 days ago • 91
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers By ariG23498 and 6 others • 17 days ago • 149
Fantastic Pretraining Optimizers and Where to Find Them Paper • 2509.02046 • Published 25 days ago • 12
view article Article Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation By cbensimon and 3 others • 26 days ago • 63
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • Aug 18 • 73
view article Article retrain-pipelines and the almighty function-caller By Aurelien-Morgan • Apr 28 • 8
view article Article The GPT-OSS models are here… and they’re energy-efficient! By sasha • Aug 7 • 19
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • Jul 29 • 179
MindJourney: Test-Time Scaling with World Models for Spatial Reasoning Paper • 2507.12508 • Published Jul 16 • 26
view article Article 5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub By fdaudens and 1 other • Jul 15 • 23