A Survey on Vision-Language-Action Models: An Action Tokenization Perspective Paper • 2507.01925 • Published Jul 2 • 36
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning Paper • 2507.16746 • Published 24 days ago • 33
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published 5 days ago • 35