Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ZyKINvice
/
Qwen2.5-0.5B-GRPO
like
0
openai/gsm8k
13 languages
License:
mit
Model card
Files
Files and versions
Community
1
README.md exists but content is empty.
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for
ZyKINvice/Qwen2.5-0.5B-GRPO
Base model
Qwen/Qwen2.5-0.5B
Finetuned
Qwen/Qwen2.5-0.5B-Instruct
Finetuned
(
401
)
this model
Dataset used to train
ZyKINvice/Qwen2.5-0.5B-GRPO
openai/gsm8k
Viewer
•
Updated
Jan 4, 2024
•
17.6k
•
343k
•
811