Running on Zero 62 62 VLM R1 Referral Expression 💬 Mark regions in images based on text descriptions
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Paper • 2504.07615 • Published 9 days ago • 26
Running on Zero 62 62 VLM R1 Referral Expression 💬 Mark regions in images based on text descriptions
omlab/Qwen2.5VL-3B-VLM-R1-REC-500steps Zero-Shot Object Detection • Updated 6 days ago • 801 • 22
omlab/Qwen2.5VL-3B-VLM-R1-REC-500steps Zero-Shot Object Detection • Updated 6 days ago • 801 • 22