From reactive to cognitive: brain-inspired spatial intelligence for embodied agents Paper • 2508.17198 • Published 15 days ago • 7
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use Paper • 2509.01055 • Published 8 days ago • 62
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion Paper • 2509.01215 • Published 7 days ago • 45
OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning Paper • 2509.01644 • Published 7 days ago • 28
FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games Paper • 2509.01052 • Published 8 days ago • 19
ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association Paper • 2509.01584 • Published 7 days ago • 6
Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views Paper • 2509.01250 • Published 7 days ago • 2
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots Paper • 2509.02530 • Published 6 days ago • 5
Drawing2CAD: Sequence-to-Sequence Learning for CAD Generation from Vector Drawings Paper • 2508.18733 • Published 13 days ago • 6