reciprocate
·
AI & ML interests
Reward models
Organizations
reciprocate/kaggle-lmarena-synth-50k
Viewer
•
Updated
•
50.7k
•
19
reciprocate/ultra-annotated-200k
Viewer
•
Updated
•
208k
•
27
reciprocate/dpo-objective-v0.2
Viewer
•
Updated
•
384
•
20
reciprocate/tinygsm_interpreter_1M
Viewer
•
Updated
•
1M
•
34
Viewer
•
Updated
•
541
•
15
reciprocate/dpo_mix-zero-math-untoxic
Viewer
•
Updated
•
6.91k
•
21
reciprocate/dpo_mix-7k_untoxic
Viewer
•
Updated
•
7.29k
•
24
•
2
reciprocate/tinygsm_mixtral_12M
Viewer
•
Updated
•
12M
•
100
•
1
reciprocate/dpo_ultra-capybara-code_filtered-best
Viewer
•
Updated
•
35.2k
•
22
•
1
Viewer
•
Updated
•
6.17k
•
22
•
2
reciprocate/dpo_ultra-capybara_filtered-best
Viewer
•
Updated
•
25.6k
•
13
reciprocate/tinygsm_mixtral_up_dedup
Viewer
•
Updated
•
1.68M
•
11
reciprocate/ultrafeedback_orca_math_cleaned_high_dpo
Viewer
•
Updated
•
48.3k
•
22
•
2
reciprocate/ultrafeedback_cleaned_high_dpo
Viewer
•
Updated
•
40k
•
16
•
2
reciprocate/ultrafeedback_orca_math_dpo
Viewer
•
Updated
•
73.8k
•
23
•
2
reciprocate/ultrafeedback_cleaned_v2_dpo
Viewer
•
Updated
•
58.6k
•
26
•
1
reciprocate/math_dpo_pairs
Viewer
•
Updated
•
4.38k
•
69
•
5
reciprocate/pku_safer_dpo_pairs
Viewer
•
Updated
•
51.8k
•
16
reciprocate/pku_better_dpo_pairs
Viewer
•
Updated
•
330k
•
16
reciprocate/orca_dpo_pairs
Viewer
•
Updated
•
14.8k
•
17
Viewer
•
Updated
•
1k
•
15
reciprocate/gsm8k-test_critiques
Viewer
•
Updated
•
753
•
17
•
2
reciprocate/gsm8k_train_pairwise
Viewer
•
Updated
•
7.04k
•
140
•
3
reciprocate/gsm8k_pairwise
Viewer
•
Updated
•
128
•
16
•
2
Viewer
•
Updated
•
13k
•
17
Viewer
•
Updated
•
10.5k
•
52
•
1
Viewer
•
Updated
•
2.37k
•
15
reciprocate/vicuna-fair-eval_format-oa
Viewer
•
Updated
•
66
•
12
reciprocate/vicuna-fair-eval
Viewer
•
Updated
•
66
•
15
reciprocate/vicuna_fair_eval_dataset
Viewer
•
Updated
•
66
•
15