eipi1-0
's Collections
LM Preference datas
updated
Viewer
•
Updated
•
183k
•
586
•
290
mlabonne/chatml_dpo_pairs
Viewer
•
Updated
•
12.9k
•
36
•
52
HuggingFaceH4/ultrachat_200k
Viewer
•
Updated
•
515k
•
19.7k
•
542
Viewer
•
Updated
•
12.9k
•
2.34k
•
304
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
2.59k
•
143
argilla/distilabel-math-preference-dpo
Viewer
•
Updated
•
2.42k
•
380
•
87
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
•
164k
•
4.16k
•
141
lvwerra/stack-exchange-paired
Viewer
•
Updated
•
31.3M
•
2.52k
•
144
Viewer
•
Updated
•
169k
•
10.8k
•
1.35k
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
•
1.02k
•
404
•
134
Viewer
•
Updated
•
2.02k
•
80
•
15
Viewer
•
Updated
•
445k
•
321
•
97
Viewer
•
Updated
•
37.1k
•
1.67k
•
237
Viewer
•
Updated
•
7.5k
•
539
•
166
Viewer
•
Updated
•
1.11M
•
5.74k
•
168
openbmb/UltraInteract_sft
Viewer
•
Updated
•
289k
•
289
•
121
allenai/olmo-2-0325-32b-preference-mix
Updated
•
136
•
12