view article Article The GPT-OSS models are here… and they’re energy-efficient! By sasha • 5 days ago • 13
view article Article Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training By siro1 and 4 others • 4 days ago • 35
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 316
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 7 days ago • 437
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 5 days ago • 271
view article Article Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • 12 days ago • 61
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 14 days ago • 145
view article Article Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • Jul 10 • 47
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 618
view article Article cocogold: training Marigold for text-grounded segmentation By pcuenq • Jul 8 • 30
view article Article Selene 1 Mini: the best small language model-as-a-judge By AtlaAI and 10 others • Jan 29 • 13
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5 • 46