view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other • 15 days ago • 51
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Paper • 2505.15966 • Published 21 days ago • 51
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana • 17 days ago • 44
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation Paper • 2505.14640 • Published 22 days ago • 14
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 22 days ago • 148
view article Article Microsoft and Hugging Face expand collaboration By jeffboudier and 2 others • 24 days ago • 21
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools By albertvillanova • 27 days ago • 29
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 28 days ago • 113
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 431
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? By danaaubakirova and 6 others • May 11 • 59
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control By danaaubakirova and 3 others • Feb 4 • 162
view article Article Gotchas in Tokenizer Behavior Every Developer Should Know By qgallouedec • Apr 18 • 37