Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

trl-lib
/
pythia-6.9b-deduped-tldr-online-dpo

TensorBoard
Safetensors
gpt_neox
Generated from Trainer
Model card Files Files and versions
xet
Metrics Training metrics Community
pythia-6.9b-deduped-tldr-online-dpo / runs
2.36 MB
  • 1 contributor
History: 1 commit
edbeeching's picture
edbeeching HF Staff
Add vwxyzjn/online_dpo_tldr_6.9b-main checkpoint
7f9cb36 verified about 1 year ago
  • Jul15_21-10-50_ip-26-0-162-79
    Add vwxyzjn/online_dpo_tldr_6.9b-main checkpoint about 1 year ago