Qwen3-14B-Intuitor-MATH-1EPOCH

Description:

An Intuitor-fine-tuned version of Qwen3-14B trained on the MATH dataset.


Citation

@article{zhao2025learning,
  title   = {Learning to Reason without External Rewards},
  author  = {Zhao, Xuandong and Kang, Zhewei and Feng, Aosong and Levine, Sergey and Song, Dawn},
  journal = {arXiv preprint arXiv:2505.19590},
  year    = {2025}
}
Downloads last month
56
Safetensors
Model size
14.8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for sunblaze-ucb/Qwen3-14B-Intuitor-MATH-1EPOCH

Finetuned
Qwen/Qwen3-14B
Finetuned
(81)
this model

Collection including sunblaze-ucb/Qwen3-14B-Intuitor-MATH-1EPOCH