Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lblaoke
's Collections
Preference Data
Draft Models
Yifan's PPO Models
Yifan's RMs
Preference Data
updated
15 days ago
Upvote
-
Dahoas/full-hh-rlhf
Viewer
•
Updated
Feb 23, 2023
•
125k
•
1.51k
•
82
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
Oct 16, 2024
•
187k
•
14.4k
•
296
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Oct 18, 2024
•
164k
•
4.26k
•
141
Skywork/Skywork-Reward-Preference-80K-v0.2
Viewer
•
Updated
Oct 25, 2024
•
77k
•
970
•
51
nvidia/HelpSteer3
Viewer
•
Updated
26 days ago
•
99k
•
2.31k
•
49
allenai/reward-bench
Viewer
•
Updated
Sep 9, 2024
•
8.11k
•
7.3k
•
95
Upvote
-
Share collection
View history
Collection guide
Browse collections