view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others • Jun 3 • 214
Running 15 15 Leaderboard: Physical Reasoning from Video 🏃 Submit and score model predictions for video and text tasks