LM Preference datas
updated
Viewer
• Updated • 183k • 1.38k
• 295
mlabonne/chatml_dpo_pairs
Viewer
• Updated • 12.9k • 97
• 55
HuggingFaceH4/ultrachat_200k
Viewer
• Updated • 515k • 69.5k
• 719
Viewer
• Updated • 12.9k • 1.72k
• 322
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated • 60.9k • 14.9k
• 162
argilla/distilabel-math-preference-dpo
Viewer
• Updated • 2.42k • 3.83k
• 88
PKU-Alignment/PKU-SafeRLHF
Viewer
• Updated • 164k • 14k
• 187
lvwerra/stack-exchange-paired
Viewer
• Updated • 31.3M • 2.14k
• 150
Viewer
• Updated • 169k • 39.1k
• 1.75k
jondurbin/truthy-dpo-v0.1
Viewer
• Updated • 1.02k • 832
• 136
Viewer
• Updated • 2.02k • 79
• 15
Viewer
• Updated • 445k • 1.15k
• 103
Viewer
• Updated • 37.1k • 2.14k
• 248
Viewer
• Updated • 7.5k • 1.16k
• 173
Viewer
• Updated • 1.11M • 23.1k
• 242
openbmb/UltraInteract_sft
Viewer
• Updated • 289k • 1.12k
• 127
allenai/olmo-2-0325-32b-preference-mix
Updated • 173
• 15
PrimeIntellect/SYNTHETIC-2-Base-Answer-Critique
Viewer
• Updated • 50k • 11
• 2