view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others โข Jun 3 โข 188
Running 14 14 Leaderboard: Physical Reasoning from Video ๐ Submit and score model predictions for video and text tasks