Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

lindafei001
/
llama-8b-instruct-medical-dpo-unlearn-10epochs-1e-5-64-128-2048

Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
dpo
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-8b-instruct-medical-dpo-unlearn-10epochs-1e-5-64-128-2048 / runs
28.3 kB
  • 1 contributor
History: 1 commit
lindafei001's picture
lindafei001
Upload trained model checkpoint
afa794a verified 16 days ago
  • Sep22_16-41-12_w-yanli-sudolm-88776406705d4aceac34e7a04afadc53-85df665cbcpx44f
    Upload trained model checkpoint 16 days ago
  • Sep22_17-35-27_w-yanli-sudolm-88776406705d4aceac34e7a04afadc53-85df665cbcpx44f
    Upload trained model checkpoint 16 days ago