A OVD ehanched Qwen 2.5VL 3B with VLM-R1 reinforcement learning.
cite: arxiv.org/abs/2504.07615
Chat template
Files info
Base model