qwen3-4b-code-reasoning-f32-GGUF

Qwen 3 Code Reasoning is a compact 4-billion-parameter model fine-tuned on nvidia/OpenCodeReasoning, designed specifically for coding and logical reasoning tasks. It excels at code generation and logical question answering. To get the best results, it is recommended to use a context window greater than 10,000 tokens, with a temperature of 0.7, top-p of 60, top-k of 0.99, and a minimum p of 0.05. Running the model in "thinking mode" is advised for more coherent and insightful outputs. While powerful, the model may occasionally overthink simple prompts and can run into context overflow with very long inputs.

Model Files

File Name	Quant Type	Size
qwen3-4b-code-reasoning.BF16.gguf	BF16	8.05 GB
qwen3-4b-code-reasoning.F16.gguf	F16	8.05 GB
qwen3-4b-code-reasoning.F32.gguf	F32	16.1 GB
qwen3-4b-code-reasoning.Q2_K.gguf	Q2_K	1.67 GB
qwen3-4b-code-reasoning.Q3_K_L.gguf	Q3_K_L	2.24 GB
qwen3-4b-code-reasoning.Q3_K_M.gguf	Q3_K_M	2.08 GB
qwen3-4b-code-reasoning.Q3_K_S.gguf	Q3_K_S	1.89 GB
qwen3-4b-code-reasoning.Q4_K_M.gguf	Q4_K_M	2.5 GB
qwen3-4b-code-reasoning.Q4_K_S.gguf	Q4_K_S	2.38 GB
qwen3-4b-code-reasoning.Q5_K_M.gguf	Q5_K_M	2.89 GB
qwen3-4b-code-reasoning.Q5_K_S.gguf	Q5_K_S	2.82 GB
qwen3-4b-code-reasoning.Q6_K.gguf	Q6_K	3.31 GB
qwen3-4b-code-reasoning.Q8_0.gguf	Q8_0	4.28 GB

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

Downloads last month: 219

GGUF

Model size

4.02B params

Architecture

qwen3

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Model tree for prithivMLmods/qwen3-4b-code-reasoning-f32-GGUF

Base model

Qwen/Qwen3-4B-Base

Finetuned

Qwen/Qwen3-4B

Finetuned

ertghiu256/qwen3-4b-code-reasoning

Quantized

(1)

this model