MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs Paper • 2506.01674 • Published Jun 2 • 28
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation Paper • 2407.02371 • Published Jul 2, 2024 • 55
Learning Referring Video Object Segmentation from Weak Annotation Paper • 2308.02162 • Published Aug 4, 2023
Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation Paper • 2309.11160 • Published Sep 20, 2023