Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation Paper • 2306.13460 • Published Jun 23, 2023 • 1
Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective Paper • 2402.14545 • Published Feb 22, 2024
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement Paper • 2503.06520 • Published Mar 9 • 11
Unveiling Visual Biases in Audio-Visual Localization Benchmarks Paper • 2409.06709 • Published Aug 25, 2024
TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM Paper • 2503.13377 • Published Mar 17 • 3
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published May 12 • 82