Magpie-Qwen-CortexDual-0.6B-GGUF / README.md

Update README.md

a4138cd verified about 2 months ago

3.24 kB

	---
	license: apache-2.0
	base_model:
	- prithivMLmods/Magpie-Qwen-CortexDual-0.6B
	library_name: transformers
	language:
	- en
	pipeline_tag: text-generation
	tags:
	- text-generation-inference
	- math
	- code
	---
	# Magpie-Qwen-CortexDual-0.6B-GGUF

	> Magpie-Qwen-CortexDual-0.6B is a specialized, general-purpose model designed for math, code, and structured reasoning. Built with CortexDual thinking mode, it dynamically adapts to the complexity of a problem, automatically shifting into a stepwise reasoning mode for intricate logic or math tasks. This 0.6B parameter model leverages 80% of the Magpie Pro 330k dataset and a modular blend of datasets for general-purpose proficiency and domain versatility.
	>
	## ModelFile

	\| File Name \| Size \| Source \|
	\|----------------------------------\|-----------\|--------\|
	\| Magpie-Qwen-0.6B.BF16.gguf \| 1.2 GB \| xet \|
	\| Magpie-Qwen-0.6B.F16.gguf \| 1.2 GB \| xet \|
	\| Magpie-Qwen-0.6B.F32.gguf \| 2.39 GB \| xet \|
	\| Magpie-Qwen-0.6B.Q4_K_M.gguf \| 397 MB \| xet \|
	\| Magpie-Qwen-0.6B.Q5_K_M.gguf \| 444 MB \| xet \|
	\| Magpie-Qwen-0.6B.Q8_0.gguf \| 639 MB \| xet \|
	\| .gitattributes \| 1.97 kB \| - \|
	\| README.md \| 723 Bytes \| - \|
	\| config.json \| 31 Bytes \| - \|

	## Quants Usage

	(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

	\| Link \| Type \| Size/GB \| Notes \|
	\|:-----\|:-----\|--------:\|:------\|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q2_K.gguf) \| Q2_K \| 0.4 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q3_K_S.gguf) \| Q3_K_S \| 0.5 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q3_K_M.gguf) \| Q3_K_M \| 0.5 \| lower quality \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q3_K_L.gguf) \| Q3_K_L \| 0.5 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.IQ4_XS.gguf) \| IQ4_XS \| 0.6 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q4_K_S.gguf) \| Q4_K_S \| 0.6 \| fast, recommended \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q4_K_M.gguf) \| Q4_K_M \| 0.6 \| fast, recommended \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q5_K_S.gguf) \| Q5_K_S \| 0.6 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q5_K_M.gguf) \| Q5_K_M \| 0.7 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q6_K.gguf) \| Q6_K \| 0.7 \| very good quality \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q8_0.gguf) \| Q8_0 \| 0.9 \| fast, best quality \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.f16.gguf) \| f16 \| 1.6 \| 16 bpw, overkill \|

	Here is a handy graph by ikawrakow comparing some lower-quality quant
	types (lower is better):

	![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)

	---
	license: apache-2.0
	base_model:
	- prithivMLmods/Magpie-Qwen-CortexDual-0.6B
	library_name: transformers
	language:
	- en
	pipeline_tag: text-generation
	tags:
	- text-generation-inference
	- math
	- code
	---
	# Magpie-Qwen-CortexDual-0.6B-GGUF

	> Magpie-Qwen-CortexDual-0.6B is a specialized, general-purpose model designed for math, code, and structured reasoning. Built with CortexDual thinking mode, it dynamically adapts to the complexity of a problem, automatically shifting into a stepwise reasoning mode for intricate logic or math tasks. This 0.6B parameter model leverages 80% of the Magpie Pro 330k dataset and a modular blend of datasets for general-purpose proficiency and domain versatility.
	>
	## ModelFile

	\| File Name \| Size \| Source \|
	\|----------------------------------\|-----------\|--------\|
	\| Magpie-Qwen-0.6B.BF16.gguf \| 1.2 GB \| xet \|
	\| Magpie-Qwen-0.6B.F16.gguf \| 1.2 GB \| xet \|
	\| Magpie-Qwen-0.6B.F32.gguf \| 2.39 GB \| xet \|
	\| Magpie-Qwen-0.6B.Q4_K_M.gguf \| 397 MB \| xet \|
	\| Magpie-Qwen-0.6B.Q5_K_M.gguf \| 444 MB \| xet \|
	\| Magpie-Qwen-0.6B.Q8_0.gguf \| 639 MB \| xet \|
	\| .gitattributes \| 1.97 kB \| - \|
	\| README.md \| 723 Bytes \| - \|
	\| config.json \| 31 Bytes \| - \|

	## Quants Usage

	(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

	\| Link \| Type \| Size/GB \| Notes \|
	\|:-----\|:-----\|--------:\|:------\|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q2_K.gguf) \| Q2_K \| 0.4 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q3_K_S.gguf) \| Q3_K_S \| 0.5 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q3_K_M.gguf) \| Q3_K_M \| 0.5 \| lower quality \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q3_K_L.gguf) \| Q3_K_L \| 0.5 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.IQ4_XS.gguf) \| IQ4_XS \| 0.6 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q4_K_S.gguf) \| Q4_K_S \| 0.6 \| fast, recommended \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q4_K_M.gguf) \| Q4_K_M \| 0.6 \| fast, recommended \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q5_K_S.gguf) \| Q5_K_S \| 0.6 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q5_K_M.gguf) \| Q5_K_M \| 0.7 \| \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q6_K.gguf) \| Q6_K \| 0.7 \| very good quality \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.Q8_0.gguf) \| Q8_0 \| 0.9 \| fast, best quality \|
	\| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-GGUF/resolve/main/Qwen3-0.6B.f16.gguf) \| f16 \| 1.6 \| 16 bpw, overkill \|

	Here is a handy graph by ikawrakow comparing some lower-quality quant
	types (lower is better):

	![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)