Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
JayHyeon
/
Qwen_0.5-ultrainteract_BDPO_5e-7-1ep_0.9bdpo_lambda
like
0
Text Generation
Transformers
Safetensors
JayHyeon/trl_ultrainteract-pair
qwen2
Generated from Trainer
trl
dpo
conversational
text-generation-inference
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen_0.5-ultrainteract_BDPO_5e-7-1ep_0.9bdpo_lambda
Commit History
End of training
0743357
verified
JayHyeon
commited on
Mar 31
Model save
ce4611d
verified
JayHyeon
commited on
Mar 31
Training in progress, step 3258
54f723b
verified
JayHyeon
commited on
Mar 31
Training in progress, step 3000
6c80e05
verified
JayHyeon
commited on
Mar 31
Training in progress, step 2500
d3f7d80
verified
JayHyeon
commited on
Mar 31
Training in progress, step 2000
70be1c5
verified
JayHyeon
commited on
Mar 31
Training in progress, step 1500
2ceb027
verified
JayHyeon
commited on
Mar 31
Training in progress, step 1000
576bf85
verified
JayHyeon
commited on
Mar 31
Training in progress, step 500
b176789
verified
JayHyeon
commited on
Mar 31
End of training
a81aa0a
verified
JayHyeon
commited on
Mar 31
Model save
9b84339
verified
JayHyeon
commited on
Mar 31
Training in progress, step 3258
85b0534
verified
JayHyeon
commited on
Mar 31
Training in progress, step 3000
a8b9916
verified
JayHyeon
commited on
Mar 31
Training in progress, step 2500
1e35c32
verified
JayHyeon
commited on
Mar 31
Training in progress, step 2000
37264af
verified
JayHyeon
commited on
Mar 31
Training in progress, step 1500
bebd086
verified
JayHyeon
commited on
Mar 31
Training in progress, step 1000
75ee297
verified
JayHyeon
commited on
Mar 31
Training in progress, step 500
2ba364b
verified
JayHyeon
commited on
Mar 31
Training in progress, step 500
7d3d728
verified
JayHyeon
commited on
Mar 31
initial commit
e872d30
verified
JayHyeon
commited on
Mar 31