altomek
/

Llama-3.1-Minitron-4B-Width-Base-Q4_0_4_4-GGUF

Model card Files Files and versions Community

Llama-3.1-Minitron-4B-Width-Base

ExLlamav2 8 bpw quant of https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Width-Base

Downloads last month: 6

GGUF

Model size

4.51B params

Architecture

llama

Hardware compatibility

Log In to view the estimation

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for altomek/Llama-3.1-Minitron-4B-Width-Base-Q4_0_4_4-GGUF

Base model

nvidia/Llama-3.1-Minitron-4B-Width-Base

Quantized

(17)

this model

Collection including altomek/Llama-3.1-Minitron-4B-Width-Base-Q4_0_4_4-GGUF

Quants for ARM

11 items • Updated Jan 4 • 1