Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
meixiang123
/
zephyr-7b-dpo-lora
like
0
Transformers
Safetensors
Generated from Trainer
dpo
trl
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
zephyr-7b-dpo-lora
Commit History
Training in progress, step 346
a0c85a2
verified
meixiang123
commited on
21 days ago
Training in progress, step 300
bf30f03
verified
meixiang123
commited on
21 days ago
Training in progress, step 200
91845fb
verified
meixiang123
commited on
22 days ago
Training in progress, step 100
efe6a57
verified
meixiang123
commited on
22 days ago
Training in progress, step 1
e098d3a
verified
meixiang123
commited on
22 days ago
Upload model
dc18c5d
verified
meixiang123
commited on
23 days ago
Upload tokenizer
0f4c38d
verified
meixiang123
commited on
24 days ago
Upload model
1de527d
verified
meixiang123
commited on
24 days ago
initial commit
508f04f
verified
meixiang123
commited on
24 days ago