Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

lindafei001
/
llama-8b-instruct-safeRLHF-dpo-10epochs-1e-5-64-128-0.5

Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
dpo
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-8b-instruct-safeRLHF-dpo-10epochs-1e-5-64-128-0.5
9.15 GB
  • 1 contributor
History: 2 commits
lindafei001's picture
lindafei001
Upload trained model checkpoint
07c1037 verified 20 days ago
  • checkpoint-169
    Upload trained model checkpoint 20 days ago
  • checkpoint-338
    Upload trained model checkpoint 20 days ago
  • checkpoint-507
    Upload trained model checkpoint 20 days ago
  • checkpoint-676
    Upload trained model checkpoint 20 days ago
  • checkpoint-845
    Upload trained model checkpoint 20 days ago
  • runs
    Upload trained model checkpoint 20 days ago
  • .gitattributes
    1.85 kB
    Upload trained model checkpoint 20 days ago
  • README.md
    2.48 kB
    Upload trained model checkpoint 20 days ago