WPRM
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
51

WPRM/qwen3-8b-ar-reward-cot-mtl-checklist-enhanced
8B
•
Updated
•
1

WPRM/qwen-3b-ar-reward-cot-mtl-checklist-enhanced
3B
•
Updated
•
1

WPRM/qwen3-8b-checklist-enhanced
8B
•
Updated
•
1

WPRM/qwen3-ar-reward-cot-mtl-same-ratio-epoch2
8B
•
Updated
•
1

WPRM/qwen3-ar-reward-cot-mtl
8B
•
Updated
•
1

WPRM/qwen3-ar-reward-cot-mtl-epoch1
8B
•
Updated
•
1

WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_mtl
4B
•
Updated
•
1

WPRM/qwen2.5-ar-reward-cot-mtl
3B
•
Updated
•
922

WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_final_new
4B
•
Updated
•
1

WPRM/qwen2.5-ar-reward-cot-final-new
3B
•
Updated
•
1
datasets
102
WPRM/ours_8b_mtl_enhanced_annotated_walite_combined_checklist
Viewer
•
Updated
•
812
•
21
WPRM/ours_3b_mtl_enhanced_annotated_walite_combined_checklist
Viewer
•
Updated
•
812
•
16
WPRM/ours_8b_enhanced_annotated_walite_combined_checklist
Viewer
•
Updated
•
812
•
12
WPRM/WebShepherd_train_multimodal_final_0513_checklist_only
Viewer
•
Updated
•
3.63k
•
16
WPRM/WebShepherd_train_text_only_final_0513_checklist_only
Viewer
•
Updated
•
3.63k
•
14
WPRM/WebShepherd_train_multimodal_final_0513
Viewer
•
Updated
•
43.2k
•
18
WPRM/WebShepherd_train_text_only_final_0513
Viewer
•
Updated
•
43.2k
•
17
WPRM/minibench-multimodal-mind2web
Viewer
•
Updated
•
4.96k
•
12
WPRM/ours_mtl3ratio_annotated_walite_combined_checklist_login2
Viewer
•
Updated
•
812
•
11
WPRM/WebShepherd_train_text_only_final_chosen_only
Viewer
•
Updated
•
10.9k
•
21