Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

lindafei001
/
llama-1b-instruct-medical-dpo-10epochs-1e-5-64-128-full

Transformers
TensorBoard
Safetensors
Generated from Trainer
dpo
trl
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-1b-instruct-medical-dpo-10epochs-1e-5-64-128-full
5.49 GB
  • 1 contributor
History: 3 commits
lindafei001's picture
lindafei001
Upload trained model checkpoint
483bd9c verified 15 days ago
  • checkpoint-188
    Upload trained model checkpoint 15 days ago
  • checkpoint-376
    Upload trained model checkpoint 15 days ago
  • checkpoint-564
    Upload trained model checkpoint 15 days ago
  • runs
    Upload trained model checkpoint 15 days ago
  • .gitattributes
    1.72 kB
    Upload trained model checkpoint 15 days ago
  • README.md
    2.48 kB
    Upload trained model checkpoint 15 days ago