Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

7,204

Full-text search

Active filters: awq

mratsim/MiniMax-M2.1-FP8-INT4-AWQ

Text Generation • 39B • Updated about 15 hours ago • 151 • 5

QuantTrio/MiniMax-M2.1-AWQ

Text Generation • 229B • Updated 7 days ago • 2.09k • 8

QuantTrio/GLM-4.7-AWQ

Text Generation • 358B • Updated 8 days ago • 15.3k • 15

stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ

Text Generation • 8B • Updated Jun 4, 2025 • 5.57k • 4

casperhansen/deepseek-r1-distill-qwen-14b-awq

15B • Updated Feb 8, 2025 • 8.2k • 14

Qwen/Qwen3-32B-AWQ

Text Generation • 33B • Updated May 21, 2025 • 76k • 119

Qwen/Qwen3-8B-AWQ

Text Generation • 8B • Updated May 21, 2025 • 102k • 32

twhitworth/gpt-oss-120b-awq-w4a16

117B • Updated Aug 19, 2025 • 2.91k • 18

QuantTrio/Seed-OSS-36B-Instruct-AWQ

Text Generation • 36B • Updated Sep 15, 2025 • 356 • 7

QuantTrio/MiniMax-M2-AWQ

Text Generation • 229B • Updated Dec 3, 2025 • 337k • 9

geonmin-kim/Qwen3-MoE-1.2B-A0.6B-AWQ

1B • Updated Nov 21, 2025 • 9 • 2

QuantTrio/DeepSeek-V3.2-AWQ

Text Generation • 685B • Updated Dec 3, 2025 • 3.71k • 9

cybermotaz/nemotron3-nano-nvfp4-w4a16

Text Generation • 18B • Updated 19 days ago • 7.16k • 7

CultriX/Nevoria-R1-70b-AWQ-W4A16-g128

Text Generation • 11B • Updated 2 days ago • 154 • 1

TheHouseOfTheDude/GLM-4.7_Compressed-Tensors

Text Generation • Updated 10 days ago • 10 • 4

casperhansen/mpt-7b-8k-chat-awq

Text Generation • Updated Nov 4, 2023 • 26 • 3

casperhansen/falcon-7b-awq

Text Generation • Updated Nov 4, 2023 • 24 • 1

casperhansen/vicuna-7b-v1.5-awq

Text Generation • Updated Oct 31, 2023 • 21 • 3

casperhansen/vicuna-7b-v1.5-awq-gemv

Text Generation • Updated Oct 31, 2023 • 19 • 1

casperhansen/mpt-7b-8k-chat-awq-gemv

Text Generation • Updated Oct 31, 2023 • 17

casperhansen/opt-125m-awq

Text Generation • 0.2B • Updated Oct 31, 2023 • 113 • 3

casperhansen/tinyllama-1b-awq

Text Generation • Updated Oct 31, 2023 • 28

Bomml/Llama-2-70B-chat-w4-g128-awq

Text Generation • Updated Sep 16, 2023

TheBloke/Llama-2-7B-Chat-AWQ

Text Generation • 7B • Updated Nov 9, 2023 • 1.84k • 24

TheBloke/Llama-2-7B-AWQ

Text Generation • 7B • Updated Nov 9, 2023 • 877 • 17

TheBloke/Llama-2-13B-AWQ

Text Generation • 13B • Updated Nov 9, 2023 • 105 • 14

TheBloke/CodeLlama-13B-Python-AWQ

Text Generation • 13B • Updated Nov 9, 2023 • 23 • 2

TheBloke/CodeLlama-13B-Instruct-AWQ

Text Generation • 13B • Updated Nov 9, 2023 • 936 • 9

TheBloke/CodeLlama-13B-AWQ

Text Generation • 13B • Updated Nov 9, 2023 • 76 • 4

TheBloke/Llama-2-13B-chat-AWQ

Text Generation • 13B • Updated Nov 9, 2023 • 335 • 26