From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models Paper • 2506.09930 • Published Jun 11 • 8
SAFE: Multitask Failure Detection for Vision-Language-Action Models Paper • 2506.09937 • Published Jun 11 • 9
Hidden in plain sight: VLMs overlook their visual representations Paper • 2506.08008 • Published Jun 9 • 8
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation Paper • 2506.18088 • Published 26 days ago • 17