Lynx-TinySync-0.6B-GGUF

Lynx-TinySync-0.6B is a lightweight, high-performance model designed for mathematical reasoning, code generation, and general-purpose inference. Built on a custom modular dataset and powered by an efficient architecture, it excels in delivering structured, accurate outputs even in mid-resource environments. Despite its compact 0.6B parameter size, it demonstrates remarkable proficiency in math, code, and technical language understanding.

Model File

File Name Size Format
Lynx-TinySync-0.6B.BF16.gguf 1.2 GB BF16
Lynx-TinySync-0.6B.F16.gguf 1.2 GB F16
Lynx-TinySync-0.6B.F32.gguf 2.39 GB F32
Lynx-TinySync-0.6B.Q4_K_M.gguf 397 MB Q4_K_M
Lynx-TinySync-0.6B.Q5_K_M.gguf 444 MB Q5_K_M
Lynx-TinySync-0.6B.Q8_0.gguf 639 MB Q8_0
.gitattributes 1.98 kB -
README.md 220 B -
config.json 31 B JSON

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

image.png

Downloads last month
135
GGUF
Model size
596M params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

8-bit

16-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for prithivMLmods/Lynx-TinySync-0.6B-GGUF

Finetuned
Qwen/Qwen3-0.6B
Quantized
(8)
this model

Collection including prithivMLmods/Lynx-TinySync-0.6B-GGUF