Edit Models filters

Apps

Apps with no match

Inference Providers

Inference Providers with no match

HF Inference API

Misc

4-bit precision

text-generation-inference

Inference Endpoints

Misc with no match

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

182

Full-text search

Active filters: autoawq

aari1995/germeo-7b-awq

Text Generation • 1B • Updated Apr 2, 2024 • 593 • 2

kaitchup/Yi-6B-awq-4bit

Text Generation • 1B • Updated Mar 21, 2024 • 22

kaitchup/Llama-3-8b-awq-4bit

Text Generation • 2B • Updated Apr 29, 2024 • 43

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k

Text Generation • 8B • Updated Jul 9, 2024 • 19

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GGUF

Text Generation • 8B • Updated Jul 9, 2024 • 11

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GPTQ

Text Generation • 2B • Updated Jul 9, 2024 • 14

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-AWQ

Text Generation • 2B • Updated Jul 9, 2024 • 14

hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4

Text Generation • 59B • Updated Sep 13, 2024 • 8.54k • 36

hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4

Text Generation • 2B • Updated Aug 7, 2024 • 280k • 68

hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4

Text Generation • 11B • Updated Aug 7, 2024 • 154k • 101

jburmeister/Meta-Llama-3.1-70B-Instruct-AWQ-INT4

Text Generation • 11B • Updated Jul 25, 2024 • 14

jburmeister/Meta-Llama-3.1-405B-Instruct-AWQ-INT4

Text Generation • 59B • Updated Jul 25, 2024 • 16

Kalei/Meta-Llama-3.1-70B-Instruct-AWQ-INT4-Custom

Text Generation • 11B • Updated Jul 31, 2024 • 32

UCLA-EMC/Meta-Llama-3.1-8B-AWQ-INT4

Text Generation • 2B • Updated Aug 30, 2024 • 36

UCLA-EMC/Meta-Llama-3.1-8B-Instruct-AWQ-INT4-32-2.17B

Text Generation • 2B • Updated Aug 30, 2024 • 17 • 1

reach-vb/Meta-Llama-3.1-8B-Instruct-AWQ-INT4-fix

Text Generation • 2B • Updated Aug 14, 2024 • 39

jburmeister/Meta-Llama-3.1-8B-Instruct-AWQ-INT4

Text Generation • 2B • Updated Aug 21, 2024 • 14

awilliamson/Meta-Llama-3.1-70B-Instruct-AWQ

Text Generation • 11B • Updated Sep 7, 2024 • 18

flowaicom/Flow-Judge-v0.1-AWQ

Text Generation • 0.7B • Updated Oct 9, 2024 • 105 • 6

hugging-quants/Mixtral-8x7B-Instruct-v0.1-AWQ-INT4

Text Generation • 6B • Updated Oct 7, 2024 • 11.3k

hugging-quants/gemma-2-9b-it-AWQ-INT4

Text Generation • 2B • Updated Oct 17, 2024 • 2.72k • 6

ibnzterrell/Nvidia-Llama-3.1-Nemotron-70B-Instruct-HF-AWQ-INT4

Text Generation • 11B • Updated Dec 7, 2024 • 1.77k • 5

NeuML/Llama-3.1_OpenScholar-8B-AWQ

Text Generation • 2B • Updated Nov 27, 2024 • 80 • 3

fbaldassarri/TinyLlama_TinyLlama_v1.1-autoawq-int4-gs128-asym

Text Generation • 0.3B • Updated Nov 27, 2024 • 17

fbaldassarri/TinyLlama_TinyLlama_v1.1-autoawq-int4-gs128-sym

Text Generation • 0.3B • Updated Nov 27, 2024 • 17

fbaldassarri/EleutherAI_pythia-14m-autoawq-int4-gs128-asym

Text Generation • 0.0B • Updated Nov 27, 2024 • 13

fbaldassarri/EleutherAI_pythia-14m-autoawq-int4-gs128-sym

Text Generation • 0.0B • Updated Nov 27, 2024 • 14

fbaldassarri/EleutherAI_pythia-31m-autoawq-int4-gs128-asym

Text Generation • 0.0B • Updated Nov 27, 2024 • 13

fbaldassarri/EleutherAI_pythia-31m-autoawq-int4-gs128-sym

Text Generation • 0.0B • Updated Nov 27, 2024 • 31

fbaldassarri/EleutherAI_pythia-70m-deduped-autoawq-int4-gs128-asym

Text Generation • 0.1B • Updated Nov 27, 2024 • 76