Preference Data - a lblaoke Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

lblaoke 's Collections

Preference Data

Yifan's PPO Models

Preference Data

updated 15 days ago

Dahoas/full-hh-rlhf

Viewer • Updated Feb 23, 2023 • 125k • 1.51k • 82
HuggingFaceH4/ultrafeedback_binarized

Viewer • Updated Oct 16, 2024 • 187k • 14.4k • 296
PKU-Alignment/PKU-SafeRLHF

Viewer • Updated Oct 18, 2024 • 164k • 4.26k • 141
Skywork/Skywork-Reward-Preference-80K-v0.2

Viewer • Updated Oct 25, 2024 • 77k • 970 • 51
nvidia/HelpSteer3

Viewer • Updated 26 days ago • 99k • 2.31k • 49
allenai/reward-bench

Viewer • Updated Sep 9, 2024 • 8.11k • 7.3k • 95

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs