A OVD ehanched Qwen 2.5VL 3B with VLM-R1 reinforcement learning.

cite: arxiv.org/abs/2504.07615

Downloads last month
646
Safetensors
Model size
3.75B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for omlab/VLM-R1-Qwen2.5VL-3B-OVD-0321

Finetuned
(115)
this model
Quantizations
2 models

Dataset used to train omlab/VLM-R1-Qwen2.5VL-3B-OVD-0321

Space using omlab/VLM-R1-Qwen2.5VL-3B-OVD-0321 1

Collection including omlab/VLM-R1-Qwen2.5VL-3B-OVD-0321