The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 9 days ago • 36
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • 11 days ago • 49
view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H By Hcompany and 1 other • 11 days ago • 65
view article Article AutoThink: Adaptive Reasoning for Large Language Models By codelion • 18 days ago • 4
view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other • 17 days ago • 52
Step 1: Reproducing DeepSeek's Distilled Models Collection Code for training and evaluation: https://github.com/huggingface/open-r1 • 3 items • Updated 19 days ago • 2
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • 24 days ago • 27
view article Article NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning By PranjaliJoshi and 1 other • 26 days ago • 24
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools By albertvillanova • 29 days ago • 29
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • about 1 month ago • 113
INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning Paper • 2505.07291 • Published May 12 • 12
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning Paper • 2504.11354 • Published Apr 15 • 5
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate By muellerzr and 3 others • Jun 13, 2024 • 55
view article Article Empowering Public Organizations: Preparing Your Data for the AI Era By evijit and 1 other • Apr 10 • 16