Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
clembench-playpen 's Collections
SFT Final Models Merged
Datasets for DPO
KTO Final Models
OLD SFT Final Models Merged
SFT Final Models
Preference Dataset KTO (Wordle & Wordle_withclue)
Llama-3.2-3B
Llama-3.1-8B
Llama-3.2-1B

Datasets for DPO

updated 22 days ago

Collection of datasets for DPO for development. Data come from clembench v0.9 and v1.0 for all games, except for referencegame (v1.6).

Upvote
-

  • clembench-playpen/DPO_turn

    Viewer • Updated 22 days ago • 87.6k • 113

  • clembench-playpen/DPO_dialogue

    Viewer • Updated 22 days ago • 10.1k • 67
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs