reciprocate
·
AI & ML interests
Reward models
Organizations
reciprocate/kaggle-lmarena-synth-50k
Viewer
•
Updated
•
50.7k
•
33
reciprocate/ultra-annotated-200k
Viewer
•
Updated
•
208k
•
48
reciprocate/dpo-objective-v0.2
Viewer
•
Updated
•
384
•
45
reciprocate/tinygsm_interpreter_1M
Viewer
•
Updated
•
1M
•
32
Viewer
•
Updated
•
541
•
33
reciprocate/dpo_mix-zero-math-untoxic
Viewer
•
Updated
•
6.91k
•
39
reciprocate/dpo_mix-7k_untoxic
Viewer
•
Updated
•
7.29k
•
42
•
2
reciprocate/tinygsm_mixtral_12M
Viewer
•
Updated
•
12M
•
148
•
1
reciprocate/dpo_ultra-capybara-code_filtered-best
Viewer
•
Updated
•
35.2k
•
40
•
1
Viewer
•
Updated
•
6.17k
•
37
•
2
reciprocate/dpo_ultra-capybara_filtered-best
Viewer
•
Updated
•
25.6k
•
31
reciprocate/tinygsm_mixtral_up_dedup
Viewer
•
Updated
•
1.68M
•
27
reciprocate/ultrafeedback_orca_math_cleaned_high_dpo
Viewer
•
Updated
•
48.3k
•
65
•
2
reciprocate/ultrafeedback_cleaned_high_dpo
Viewer
•
Updated
•
40k
•
35
•
2
reciprocate/ultrafeedback_orca_math_dpo
Viewer
•
Updated
•
73.8k
•
45
•
2
reciprocate/ultrafeedback_cleaned_v2_dpo
Viewer
•
Updated
•
58.6k
•
46
•
1
reciprocate/math_dpo_pairs
Viewer
•
Updated
•
4.38k
•
56
•
5
reciprocate/pku_safer_dpo_pairs
Viewer
•
Updated
•
51.8k
•
33
reciprocate/pku_better_dpo_pairs
Viewer
•
Updated
•
330k
•
32
reciprocate/orca_dpo_pairs
Viewer
•
Updated
•
14.8k
•
31
Viewer
•
Updated
•
1k
•
34
reciprocate/gsm8k-test_critiques
Viewer
•
Updated
•
753
•
32
•
2
reciprocate/gsm8k_train_pairwise
Viewer
•
Updated
•
7.04k
•
190
•
3
reciprocate/gsm8k_pairwise
Viewer
•
Updated
•
128
•
27
•
2
Viewer
•
Updated
•
13k
•
30
Viewer
•
Updated
•
10.5k
•
87
•
1
Viewer
•
Updated
•
2.37k
•
30
reciprocate/vicuna-fair-eval_format-oa
Viewer
•
Updated
•
66
•
29
reciprocate/vicuna-fair-eval
Viewer
•
Updated
•
66
•
35
reciprocate/vicuna_fair_eval_dataset
Viewer
•
Updated
•
66
•
33