Oganesson-TinyLlama-1.2B-GGUF

Oganesson-TinyLlama-1.2B is a lightweight and efficient language model built on the LLaMA 3.2 1.2B architecture. Fine-tuned for general-purpose inference, mathematical reasoning, and code generation, it’s ideal for edge devices, personal assistants, and educational applications requiring a compact yet capable model.

Model File

File Name	Size	Format
Oganesson-TinyLlama-1.2B.BF16.gguf	2.48 GB	BF16
Oganesson-TinyLlama-1.2B.F16.gguf	2.48 GB	F16
Oganesson-TinyLlama-1.2B.F32.gguf	4.95 GB	F32
Oganesson-TinyLlama-1.2B.Q4_K_M.gguf	808 MB	Q4_K_M
.gitattributes	1.8 kB	-
README.md	212 B	-
config.json	31 B	JSON

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

prithivMLmods
/

Oganesson-TinyLlama-1.2B-GGUF

Oganesson-TinyLlama-1.2B-GGUF

Model File

Quants Usage

Model tree for prithivMLmods/Oganesson-TinyLlama-1.2B-GGUF

Collection including prithivMLmods/Oganesson-TinyLlama-1.2B-GGUF

Edge LLM Variants x2