mlfoundations-cua-dev/qwen2_5vl_7b_easyr1_110k_4MP_jedi_ui_vision_gta1_data_lr_1_0e-06_z3_4nodes Image-to-Text • 8B • Updated 1 day ago
mlfoundations-cua-dev/grpo-7b-stage-3-on-103k-filtered-data-temp-2-1-0p2-zero-only-show-ui-ui-vision-jedi-gta 8B • Updated 1 day ago • 3
mlfoundations-cua-dev/grpo-7b-stage-3-on-103k-filtered-data-temp-2-2-zero-correct-to-0.2-no-pixmo-uground-seeclick 8B • Updated 1 day ago • 5
mlfoundations-cua-dev/grpo-7b-stage-3-on-103k-filtered-data-temp-2-1-0p2-show-ui-ui-vision-jedi-gta-dense-reward 8B • Updated 1 day ago • 1
mlfoundations-cua-dev/easyr1-osworld-g-refined-eval-4MP-messages-training Viewer • Updated 1 day ago • 510
mlfoundations-cua-dev/easyr1-103k-4MP-not-all-correct-stage-one-temp-1_1-RL-remove-pixmo-uground-seeclick Viewer • Updated 3 days ago • 67.2k • 88
mlfoundations-cua-dev/easyr1-103k-4MP-stage-one-temp-1_1-RL-min-2-pass-max-7-pass Viewer • Updated 5 days ago • 51.1k • 40
mlfoundations-cua-dev/easyr1-103k-4MP-stage-three-temp-1_7-RL-only-ui-vision-jedi-show-ui-desktop-gta-0p0-zero Viewer • Updated 6 days ago • 6.03k • 58
mlfoundations-cua-dev/easyr1-103k-4MP-stage-three-temp-1_7-RL-ui-vision-jedi-show-ui-desktop-gta-dense-reward Viewer • Updated 7 days ago • 16.2k • 67
mlfoundations-cua-dev/easyr1-103k-4MP-stage-three-temp-1_7-RL-ui-vision-jedi-show-ui-desktop-gta-0p2-zero-dense-reward Viewer • Updated 7 days ago • 7.54k • 88