VLM - a RzZ Collection

RzZ 's Collections

Robotic

VLM

VLM

updated 1 day ago

UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

Paper • 2312.15715 • Published Dec 25, 2023 • 21
Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Paper • 2505.23747 • Published May 29 • 67
VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 34
Scaling RL to Long Videos

Paper • 2507.07966 • Published 2 days ago • 103