Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jasonhuang3
/
Pro6000-llama3-2-1b-instruct-dpo-lora-28k
like
0
Transformers
Safetensors
Generated from Trainer
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Pro6000-llama3-2-1b-instruct-dpo-lora-28k
Commit History
Model save
4715ede
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 7000
2470cfb
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 6500
3a92ed3
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 6000
6a019f4
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 5500
5400bb7
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 5000
fe965f4
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 4500
9f47182
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 4000
f61f3e8
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 3500
cd6dee4
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 3000
375696d
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 2500
e89c24a
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 2000
7efd76f
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 1500
572611e
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 1000
de6d3e6
verified
jasonhuang3
commited on
29 days ago
Training in progress, step 500
d11e4fc
verified
jasonhuang3
commited on
29 days ago
initial commit
7e16d5c
verified
jasonhuang3
commited on
29 days ago