Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Kyleyee
/
Qwen2.5-1.5B-reward-hh-retrain
like
0
Text Classification
Transformers
Safetensors
Kyleyee/train_data_Helpful_implicit_prompt
qwen2
Generated from Trainer
trl
reward-trainer
text-generation-inference
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
main
Qwen2.5-1.5B-reward-hh-retrain
Commit History
End of training
7fed051
verified
Kyleyee
commited on
Apr 24
Model save
a5e5185
verified
Kyleyee
commited on
Apr 24
Model save
02c07cc
verified
Kyleyee
commited on
Apr 24
Training in progress, epoch 2
873b807
verified
Kyleyee
commited on
Apr 24
Training in progress, epoch 2
59c0d84
verified
Kyleyee
commited on
Apr 24
Training in progress, epoch 1
f70cf09
verified
Kyleyee
commited on
Apr 24
initial commit
de5c9cd
verified
Kyleyee
commited on
Apr 24