zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step80 8B • Updated 23 days ago • 42
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step160 8B • Updated 23 days ago • 32
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step240 8B • Updated 23 days ago • 31
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step304 8B • Updated 23 days ago • 30
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step256 8B • Updated 23 days ago • 177
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step144 8B • Updated 23 days ago • 104
zhangchenxu/Qwen2.5-VL-7B-Instruct-SFT-visualsphinx_10k_random-LR2.0e-5-EPOCHS3-LF Image-to-Text • 8B • Updated 25 days ago • 51
zhangchenxu/Qwen2.5-VL-7B-Instruct-SFT-visualsphinx_10k_reject-LR2.0e-5-EPOCHS3-LF Image-to-Text • 8B • Updated 25 days ago • 53
zhangchenxu/Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp12_nothink-GRPO-01_step256 8B • Updated May 14 • 15