Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Rexhaif
/
Qwen2.5-1.5B-MT-Eval-Reasoner
like
0
Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
axolotl
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
a3b39c6
Qwen2.5-1.5B-MT-Eval-Reasoner
Commit History
Training in progress, step 1000
a3b39c6
verified
Rexhaif
commited on
Apr 17
Training in progress, step 4000
5e7fff5
verified
Rexhaif
commited on
Apr 16
Training in progress, step 3000
804d4ad
verified
Rexhaif
commited on
Apr 16
Training in progress, step 2000
f43b8c6
verified
Rexhaif
commited on
Apr 16
Training in progress, step 1000
46e68aa
verified
Rexhaif
commited on
Apr 16
Training in progress, step 1500
e3c0d69
verified
Rexhaif
commited on
Apr 15
Training in progress, step 1000
27053c7
verified
Rexhaif
commited on
Apr 15
Training in progress, step 500
1b54b03
verified
Rexhaif
commited on
Apr 15
Training in progress, step 1000
3e04a7f
verified
Rexhaif
commited on
Apr 15
Training in progress, step 500
bab9491
verified
Rexhaif
commited on
Apr 15
initial commit
9d63a1d
verified
Rexhaif
commited on
Apr 14