Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nomadrp
/
dpo-v1.1
like
0
Transformers
Safetensors
Generated from Trainer
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
dpo-v1.1
Commit History
dpo-v1.1
e4681a9
verified
nomadrp
commited on
22 days ago
Training in progress, step 100
d2b0d3b
verified
nomadrp
commited on
22 days ago
initial commit
194c840
verified
nomadrp
commited on
22 days ago