Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Hugging Quants

AI & ML interests

Optimised quants for high-throughput deployments! Compatible with Transformers, TGI & vLLM 🤗

hugging-quants 's collections 3

Gemma2 AWQ Quants

Optimised AWQ Quants for high-throughput deployments of Gemma2! Compatible with Transformers, TGI & VLLM 🤗

hugging-quants/gemma-2-9b-it-AWQ-INT4

Text Generation • 2B • Updated Oct 17, 2024 • 2.26k • 6

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗

hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4

Text Generation • 59B • Updated Sep 13, 2024 • 1.57k • 36
hugging-quants/Meta-Llama-3.1-405B-Instruct-BNB-NF4

Text Generation • 214B • Updated Sep 16, 2024 • 10 • 5
hugging-quants/Meta-Llama-3.1-405B-Instruct-GPTQ-INT4

Text Generation • 59B • Updated Aug 7, 2024 • 1.19k • 16
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4

Text Generation • 11B • Updated Aug 7, 2024 • 440k • 105

Llama 3.2 3B & 1B GGUF Quants

Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models.

hugging-quants/Llama-3.2-3B-Instruct-Q8_0-GGUF

Text Generation • 3B • Updated Sep 25, 2024 • 5.54k • 52
hugging-quants/Llama-3.2-3B-Instruct-Q4_K_M-GGUF

Text Generation • 3B • Updated Sep 25, 2024 • 8.93k • 20
hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF

Text Generation • 1B • Updated Sep 25, 2024 • 21.4k • 34
hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF

Text Generation • 1B • Updated Sep 25, 2024 • 45.8k • 17

Gemma2 AWQ Quants

Optimised AWQ Quants for high-throughput deployments of Gemma2! Compatible with Transformers, TGI & VLLM 🤗

hugging-quants/gemma-2-9b-it-AWQ-INT4

Text Generation • 2B • Updated Oct 17, 2024 • 2.26k • 6

Llama 3.2 3B & 1B GGUF Quants

Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models.

hugging-quants/Llama-3.2-3B-Instruct-Q8_0-GGUF

Text Generation • 3B • Updated Sep 25, 2024 • 5.54k • 52
hugging-quants/Llama-3.2-3B-Instruct-Q4_K_M-GGUF

Text Generation • 3B • Updated Sep 25, 2024 • 8.93k • 20
hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF

Text Generation • 1B • Updated Sep 25, 2024 • 21.4k • 34
hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF

Text Generation • 1B • Updated Sep 25, 2024 • 45.8k • 17

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗

hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4

Text Generation • 59B • Updated Sep 13, 2024 • 1.57k • 36
hugging-quants/Meta-Llama-3.1-405B-Instruct-BNB-NF4

Text Generation • 214B • Updated Sep 16, 2024 • 10 • 5
hugging-quants/Meta-Llama-3.1-405B-Instruct-GPTQ-INT4

Text Generation • 59B • Updated Aug 7, 2024 • 1.19k • 16
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4

Text Generation • 11B • Updated Aug 7, 2024 • 440k • 105

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs