VLM with GRPO training for vision-grounded decision making (https://arxiv.org/pdf/2503.16965)
Derek Zhe Hu
zhehuderek
AI & ML interests
NLP, Multimodality
Recent Activity
updated
a model
4 days ago
zhehuderek/qwen2_vl_7b_decisionmaking_4_50
published
a model
4 days ago
zhehuderek/qwen2_vl_7b_decisionmaking_4_50
updated
a model
4 days ago
zhehuderek/qwen2_vl_7b_decisionmaking_GEOQA_8K_R1V_55
Organizations
None yet