SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Paper • 2501.18564 • Published Jan 30 • 2
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Paper • 2501.18564 • Published Jan 30 • 2
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper • 2505.09990 • Published 10 days ago • 11
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper • 2505.09990 • Published 10 days ago • 11