sunblaze-ucb
/

Qwen3-14B-Intuitor-MATH-1EPOCH

Text Generation

Model card Files Files and versions Community

Qwen3-14B-Intuitor-MATH-1EPOCH

Description:

An Intuitor-fine-tuned version of Qwen3-14B trained on the MATH dataset.

Citation

@article{zhao2025learning,
  title   = {Learning to Reason without External Rewards},
  author  = {Zhao, Xuandong and Kang, Zhewei and Feng, Aosong and Levine, Sergey and Song, Dawn},
  journal = {arXiv preprint arXiv:2505.19590},
  year    = {2025}
}

Downloads last month: 56

Safetensors

Model size

14.8B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for sunblaze-ucb/Qwen3-14B-Intuitor-MATH-1EPOCH

Base model

Qwen/Qwen3-14B-Base

Finetuned

Finetuned

(81)

this model

Collection including sunblaze-ucb/Qwen3-14B-Intuitor-MATH-1EPOCH

Intuitor

Models in the paper "Learning to Reason without External Rewards" • 12 items • Updated Jun 25