mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-140
8B
β’
Updated
β’
4
mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-190
8B
β’
Updated
β’
8
mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-220
8B
β’
Updated
β’
5
mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-250
8B
β’
Updated
β’
8
mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-180
8B
β’
Updated
β’
8
mlfoundations-cua-dev/Gelato-UI-Tars-1-5-7B
Updated
mlfoundations-cua-dev/qwen3_vl_30b_grpo-stage-1-on-103k-filtered-data-dynamic-sampling-partial-data
Updated
mlfoundations-cua-dev/qwen2_5vl_7b_110k_plus_agentnet_clicks_lr_1_0e-06_z3_4nodes
8B
β’
Updated
β’
4
mlfoundations-cua-dev/grpo-7b-67k-filtered-data-5k-refusal-global-step-120
8B
β’
Updated
β’
5
mlfoundations-cua-dev/grpo-7b-67k-filtered-data-5k-refusal-global-step-100
8B
β’
Updated
β’
6
mlfoundations-cua-dev/grpo-7b-67k-filtered-data-5k-refusal-global-step-80
8B
β’
Updated
β’
3
mlfoundations-cua-dev/grpo-7b-67k-filtered-data-5k-refusal-global-step-60
8B
β’
Updated
β’
5
mlfoundations-cua-dev/grpo-7b-67k-filtered-data-5k-refusal-global-step-40
8B
β’
Updated
β’
4
mlfoundations-cua-dev/grpo-7b-67k-filtered-data-5k-refusal-global-step-20
8B
β’
Updated
β’
5
mlfoundations-cua-dev/uitars-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-255
8B
β’
Updated
β’
8
mlfoundations-cua-dev/uitars-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-261
8B
β’
Updated
β’
5
mlfoundations-cua-dev/uitars-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-264
8B
β’
Updated
β’
8
mlfoundations-cua-dev/qwen-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-210
8B
β’
Updated
β’
8
mlfoundations-cua-dev/uitars-7b-grpo-on-103k-filtered-data-dynamic-sampling-clip-high-step-273
8B
β’
Updated
β’
9
mlfoundations-cua-dev/ui_tars_7b_easyr1_110k_4MP_jedi_ui_vision_gta1_data_lr_1_0e-06_z3_4nodes_full_epoch
8B
β’
Updated
β’
5
mlfoundations-cua-dev/grpo-7b-stage-1-on-103k-filtered-data-dynamic-sampling-clip-high-no-pixmo-uground-seeclick
Updated
mlfoundations-cua-dev/ui_tars_7b_easyr1_110k_4MP_jedi_ui_vision_gta1_data_lr_1_0e-06_z3_4nodes_0_7_epoch
8B
β’
Updated
β’
4
mlfoundations-cua-dev/grpo-7b-stage-1-on-103k-dense-reward-step-80
8B
β’
Updated
β’
8
mlfoundations-cua-dev/qwen2_5vl_7b_model_soup_5x_uniform_add_110k_sft
8B
β’
Updated
β’
10
mlfoundations-cua-dev/qwen2_5vl_7b_easyr1_110k_4MP_jedi_ui_vision_gta1_data_lr_1_0e-06_z3_4nodes
Image-to-Text
β’
8B
β’
Updated
β’
7
mlfoundations-cua-dev/grpo-7b-stage-1-on-103k-dense-reward-step-160
8B
β’
Updated
β’
9
mlfoundations-cua-dev/grpo-7b-stage-1-on-103k-dense-reward-step-140
8B
β’
Updated
β’
9
mlfoundations-cua-dev/grpo-7b-stage-1-on-103k-dense-reward-step-120
8B
β’
Updated
β’
9
mlfoundations-cua-dev/grpo-7b-stage-1-on-103k-dense-reward-step-100
8B
β’
Updated
β’
8
mlfoundations-cua-dev/grpo-7b-stage-3-on-103k-filtered-data-temp-2-1-0p2-zero-only-show-ui-ui-vision-jedi-gta
8B
β’
Updated
β’
9