Edge LLM Variants x2
Collection
custom llms
•
8 items
•
Updated
•
1
Oganesson-TinyLlama-1.2B is a lightweight and efficient language model built on the LLaMA 3.2 1.2B architecture. Fine-tuned for general-purpose inference, mathematical reasoning, and code generation, it’s ideal for edge devices, personal assistants, and educational applications requiring a compact yet capable model.
File Name | Size | Format |
---|---|---|
Oganesson-TinyLlama-1.2B.BF16.gguf | 2.48 GB | BF16 |
Oganesson-TinyLlama-1.2B.F16.gguf | 2.48 GB | F16 |
Oganesson-TinyLlama-1.2B.F32.gguf | 4.95 GB | F32 |
Oganesson-TinyLlama-1.2B.Q4_K_M.gguf | 808 MB | Q4_K_M |
.gitattributes | 1.8 kB | - |
README.md | 212 B | - |
config.json | 31 B | JSON |
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):
4-bit
16-bit
32-bit
Base model
meta-llama/Llama-3.2-1B