Text Generation
GGUF
English
reasoning
mistral
conversational

Mistral Small Reasoning GGUF

Original model: Mistral-Small-24B-Instruct-2501-reasoning

Model creator: yentinglin

This model is a fine-tuned version of mistralai/Mistral-Small-24B-Instruct-2501, specifically optimized for mathematical reasoning tasks. It has been fine-tuned on datasets including OpenR1-Math-220k, and s1K-1.1, aiming to enhance its reasoning capabilities.

This repo contains GGUF format model files for Yen-Ting Lin’s Mistral Small Reasoning.

What is GGUF?

GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023.

Converted with llama.cpp build 4735 (revision 73e2ed3), using autogguf-rs.

Prompt template: Mistral Instruct (New)


[SYSTEM_PROMPT]{{system_message}}[/SYSTEM_PROMPT]

[INST]{{prompt}}[/INST]
{{assistant_message}}

Download & run with cnvrs on iPhone, iPad, and Mac!

cnvrs.ai

cnvrs is the best app for private, local AI on your device:


Original Model Evaluation

The evaluation code is available at Hugging Face Open-R1. Note that I have updated the AIME 25 dataset to the full set, available at AIME 2025.

Our results below are averaged over multiple runs. See our eval details here.

Pass@1 # Params MATH-500 AIME 2025 AIME 2024 GPQA Diamond
Mistral-24B-Reasoning (Ours) 24B 95.0 53.33 66.67 62.02
Mistral-24B-Instruct 24B 70.6 - - 45.3
s1.1-32B 32B 93.2 40.0 56.7 61.62
LIMO 32B 94.8 36.67 57.1 59.09
DeepSeek-R1-Distill-Llama-70B 70B 94.5 46.67 70.0 65.2
DeepSeek-R1-Distill-Qwen-32B 32B 94.3 60.0 72.6 62.1
DeepSeek-R1 671B 97.3 70.0 72.6 71.5
o1 - 96.4 79.0 - 75.7
o3-mini (high) - 97.9 86.5 - 77.2
o3-mini (medium) - 97.3 76.5 - 74.9
Downloads last month
2,332
GGUF
Model size
23.6B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

1-bit

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for brittlewis12/Mistral-Small-24B-Instruct-2501-reasoning-GGUF

Datasets used to train brittlewis12/Mistral-Small-24B-Instruct-2501-reasoning-GGUF