SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2 • 116
Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data Paper • 2507.07095 • Published 16 days ago • 53
MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 0.1B • Updated Sep 6, 2023 • 836k • 318