Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sunblaze-ucb
/
Qwen2.5-3B-GRPO-MATH-1EPOCH
like
0
Follow
sunblaze-ucb
10
Text Generation
Safetensors
math
English
qwen2
conversational
arxiv:
2505.19590
arxiv:
2402.03300
License:
apache-2.0
Model card
Files
Files and versions
Community
main
Qwen2.5-3B-GRPO-MATH-1EPOCH
/
merges.txt
back-prop
Upload folder using huggingface_hub
f0c2736
verified
13 days ago
raw
Copy download link
history
contribute
delete
Safe
1.67 MB
File too large to display, you can
check the raw version
instead.