Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
chloeli
/
qwen-2.5-0.5B-instruct-sft-lora-countdown-search-1k
like
0
Text Generation
Transformers
Safetensors
MelinaLaimon/stream-of-search
qwen2
Generated from Trainer
alignment-handbook
trl
sft
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
main
qwen-2.5-0.5B-instruct-sft-lora-countdown-search-1k
Commit History
End of training
d44dfb4
verified
chloeli
commited on
Mar 29
Model save
28c454a
verified
chloeli
commited on
Mar 29
Training in progress, step 125
a7762a5
verified
chloeli
commited on
Mar 29
Training in progress, step 100
6e9dab0
verified
chloeli
commited on
Mar 29
End of training
771f4e8
verified
chloeli
commited on
Mar 27
Model save
620900d
verified
chloeli
commited on
Mar 27
Training in progress, step 125
7023c32
verified
chloeli
commited on
Mar 27
Training in progress, step 100
9758a88
verified
chloeli
commited on
Mar 27
initial commit
2ec18c5
verified
chloeli
commited on
Mar 27