view article Article Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them) By FriendliAI and 2 others • 4 days ago • 21
view article Article Bringing Fusion Down to Earth: ML for Stellarator Optimization By cgeorgiaw • 3 days ago • 55
Pretrained Transformers as Universal Computation Engines Paper • 2103.05247 • Published Mar 9, 2021 • 1
LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs Paper • 2506.21862 • Published 8 days ago • 32
Approximating Language Model Training Data from Weights Paper • 2506.15553 • Published 17 days ago • 1
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning Paper • 2410.21845 • Published Oct 29, 2024 • 16
Chain-of-Thought Reasoning is a Policy Improvement Operator Paper • 2309.08589 • Published Sep 15, 2023 • 2
view article Article Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm By nvidia and 4 others • 24 days ago • 65
Layer by Layer: Uncovering Hidden Representations in Language Models Paper • 2502.02013 • Published Feb 4 • 2
Autonomous Improvement of Instruction Following Skills via Foundation Models Paper • 2407.20635 • Published Jul 30, 2024 • 1
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning Paper • 2412.09858 • Published Dec 13, 2024 • 2
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning Paper • 2401.16013 • Published Jan 29, 2024 • 26
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents Paper • 2505.22954 • Published May 29 • 12
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2 • 108
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana • May 26 • 44
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published May 14 • 65