|
--- |
|
library_name: transformers |
|
tags: |
|
- generated_from_trainer |
|
datasets: |
|
- When-Does-Reasoning-Matter/general-reasoning-ift-pairs |
|
- When-Does-Reasoning-Matter/math-reasoning-ift-pairs |
|
language: |
|
- en |
|
pipeline_tag: text-generation |
|
--- |
|
|
|
# When Does Reasoning Matter? |
|
|
|
<p align="left"> |
|
<img src="https://cdn-avatars.huggingface.co/v1/production/uploads/62be186a5f59ff2320e6e32b/GjJ15tY7-F4bqR96FN4pd.png" alt="Dataset Icon" width="180"/> |
|
</p> |
|
|
|
<p align="left"> |
|
<a href="https://arxiv.org/pdf/2509.22193" target="_blank" rel="noopener noreferrer"> |
|
<img src="https://img.shields.io/badge/arXiv-2509.22193-b31b1b.svg?style=for-the-badge" alt="arXiv:2509.22193" /> |
|
</a> |
|
</p> |
|
|
|
|
|
This model was trained as part of the paper [When Does Reasoning Matter?](https://arxiv.org/pdf/2509.22193) |
|
It belongs to a collection of **General and Math-specific student models** distilled from Instruction-Fine-Tuned (IFT) or Reasoning answers generated by [Qwen/Qwen3-235B-A22B](https://huggingface.co/Qwen/Qwen3-235B-A22B). |
|
|
|
<img src="https://huggingface.co/api/resolve-cache/models/When-Does-Reasoning-Matter/Qwen2.5-0.5B-ift/733797fee2fdd300e1a0453d368250327fe4cc44/results.png?%2FWhen-Does-Reasoning-Matter%2FQwen2.5-0.5B-ift%2Fresolve%2Fmain%2Fresults.png=&etag=%22d36dedfbca764a8ac9a7a5ebc043ca53f5ee4966%22" alt="results" width="600"/> |
|
|
|
--- |
|
|
|
## Datasets |
|
|
|
These models were trained on the **largest set of IFT and Reasoning answer pairs**: |
|
- **General dataset**: [general-reasoning-ift-pairs](https://huggingface.co/datasets/When-Does-Reasoning-Matter/general-reasoning-ift-pairs) |
|
- **Math dataset**: [math-reasoning-ift-pairs](https://huggingface.co/datasets/When-Does-Reasoning-Matter/math-reasoning-ift-pairs) |
|
|
|
--- |
|
|
|
## Available Models |
|
|
|
<table> |
|
<thead> |
|
<tr> |
|
<th colspan="2">General</th> |
|
<th colspan="2">Math</th> |
|
</tr> |
|
<tr> |
|
<th>IFT Models</th> |
|
<th>Reasoning Models</th> |
|
<th>IFT Models</th> |
|
<th>Reasoning Models</th> |
|
</tr> |
|
</thead> |
|
<tbody> |
|
<tr> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-0.5B-ift">Qwen2.5-0.5B-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-0.5B-reasoning">Qwen2.5-0.5B-reasoning</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-0.5B-math-ift">Qwen2.5-0.5B-math-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-0.5B-math-reasoning">Qwen2.5-0.5B-math-reasoning</a></td> |
|
</tr> |
|
<tr> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-1.5B-ift">Qwen2.5-1.5B-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-1.5B-reasoning">Qwen2.5-1.5B-reasoning</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-1.5B-math-ift">Qwen2.5-1.5B-math-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-1.5B-math-reasoning">Qwen2.5-1.5B-math-reasoning</a></td> |
|
</tr> |
|
<tr> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-3B-ift">Qwen2.5-3B-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-3B-reasoning">Qwen2.5-3B-reasoning</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-3B-math-ift">Qwen2.5-3B-math-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-3B-math-reasoning">Qwen2.5-3B-math-reasoning</a></td> |
|
</tr> |
|
<tr> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-7B-ift">Qwen2.5-7B-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-7B-reasoning">Qwen2.5-7B-reasoning</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-7B-math-ift">Qwen2.5-7B-math-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-7B-math-reasoning">Qwen2.5-7B-math-reasoning</a></td> |
|
</tr> |
|
<tr> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-14B-ift">Qwen2.5-14B-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-14B-reasoning">Qwen2.5-14B-reasoning</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-14B-math-ift">Qwen2.5-14B-math-ift</a></td> |
|
<td><a href="https://huggingface.co/When-Does-Reasoning-Matter/Qwen2.5-14B-math-reasoning">Qwen2.5-14B-math-reasoning</a></td> |
|
</tr> |
|
</tbody> |
|
</table> |
|
|
|
--- |
|
|
|
If you use this dataset in your work, please cite: **[When Does Reasoning Matter?](https://arxiv.org/pdf/2509.22193)** |
|
|
|
```bibtex |
|
@misc{boizard2025doesreasoningmattercontrolled, |
|
title={When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance}, |
|
author={Nicolas Boizard and Hippolyte Gisserot-Boukhlef and Kevin El-Haddad and Céline Hudelot and Pierre Colombo}, |
|
year={2025}, |
|
eprint={2509.22193}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.CL}, |
|
url={https://arxiv.org/abs/2509.22193}, |
|
} |
|
``` |
|
|