Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

compressed-tensors

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,999

Full-text search

Active filters: compressed-tensors

AP314159/DeepSeek-V2-Lite-Chat-FP8

Text Generation • 16B • Updated 16 days ago • 22

Ba2han/augment5-4b-fp8-GPTQ

Image-Text-to-Text • 4B • Updated 16 days ago • 33

AIDXteam/Qwen3-235B-A22B-Instruct-2507-NVFP4A16

133B • Updated 16 days ago • 35

braginpawel/zft-v4-34b-kto-1-W8A8

34B • Updated 16 days ago • 20

gesong2077/Qwen3-Coder-30B-A3B-Instruct-NVFP4

17B • Updated 16 days ago • 21

cpatonn/granite-4.0-h-tiny-AWQ-4bit

Text Generation • 2B • Updated 15 days ago • 455

cpatonn/granite-4.0-h-tiny-AWQ-8bit

Text Generation • 3B • Updated 15 days ago • 81

cpatonn/granite-4.0-h-small-AWQ-4bit

Text Generation • 10B • Updated 15 days ago • 277

cpatonn/granite-4.0-h-small-AWQ-8bit

Text Generation • 13B • Updated 15 days ago • 43

kylesayrs/Llama-3.2-1B-Instruct-attention-fp8-head

1B • Updated 15 days ago • 14

nm-testing/Llama-3.2-1B-Instruct-attention-fp8-head

1B • Updated 13 days ago • 10

kylesayrs/Llama-3.2-1B-Instruct-attention-nvfp4-head

1B • Updated 15 days ago • 13

Kark07/Qwen3-4B-Instruct-llmcomp-awq-4bit

1B • Updated 15 days ago • 56

ronantakizawa/SmolVLM-Instruct-gptq

Image-Text-to-Text • 0.8B • Updated 10 days ago • 64 • 1

ronantakizawa/SmolVLM-Instruct-awq

Image-Text-to-Text • 0.8B • Updated 10 days ago • 66 • 1

philkuz/llama-3.3-70b-instruct-fp8

Text Generation • 71B • Updated 14 days ago • 458 • 1

RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4

133B • Updated 14 days ago • 139

cpatonn/Qwen3-VL-8B-Instruct-AWQ-4bit

Image-Text-to-Text • 3B • Updated 14 days ago • 4.6k • 1

cpatonn/Qwen3-VL-8B-Instruct-AWQ-8bit

Image-Text-to-Text • 4B • Updated 14 days ago • 1.03k • 1

cpatonn/Qwen3-VL-8B-Thinking-AWQ-4bit

Image-Text-to-Text • 3B • Updated 14 days ago • 1.44k • 1

cpatonn/Qwen3-VL-8B-Thinking-AWQ-8bit

Image-Text-to-Text • 4B • Updated 14 days ago • 634 • 1

cpatonn/Qwen3-VL-4B-Instruct-AWQ-8bit

Image-Text-to-Text • 2B • Updated 14 days ago • 241 • 1

cpatonn/Qwen3-VL-4B-Thinking-AWQ-4bit

Image-Text-to-Text • 2B • Updated 14 days ago • 522

cpatonn/Qwen3-VL-4B-Thinking-AWQ-8bit

Image-Text-to-Text • 2B • Updated 14 days ago • 91

Kark07/Qwen3-4B-Instruct-2507-llmcomp-simple-W4A8

4B • Updated 14 days ago • 28

Kark07/Qwen3-4B-Instruct-2507-llmcomp-simple-W4A16

4B • Updated 14 days ago • 21

Kark07/Qwen3-4B-Instruct-2507-llmcomp-simple-W8A8

4B • Updated 14 days ago • 28

ConicCat/Qwen2.5-72B-Instruct-abliterated-FP8-Dynamic

73B • Updated 13 days ago • 19

zlyngkhoi/Qwen3-4B-Instruct-2507-GPTQ-W8A8

4B • Updated 13 days ago • 44

zlyngkhoi/Qwen3-4B-Instruct-2507-llmcomp-gptq-4bit-ifeval

1B • Updated 13 days ago • 19