Edit Models filters

Apps

Apps with no match

Inference Providers

Inference Providers with no match

HF Inference API

Misc

4-bit precision

8-bit precision

Inference Endpoints

text-generation-inference

Mixture of Experts

Carbon Emissions

text-embeddings-inference

Models

2,368

Full-text search

Active filters: quantized

RedHatAI/Qwen3-32B-quantized.w4a16

Text Generation • 6B • Updated May 13 • 3.9k • 7

TheMelonGod/Qwen3-8B-exl2

Text Generation • Updated May 22 • 34 • 1

nvidia/DeepSeek-R1-0528-FP4

Text Generation • Updated 19 days ago • 8.85k • 25

humbleakh/qwen2.5-vl-3b-8bit-chain-of-zoom

Image-to-Text • Updated 20 days ago • 72 • 1

PinkPixel/Crystal-Think-V2-GGUF

Text Generation • 4B • Updated 2 days ago • 16 • 1

PinkPixel/Crystal-Think-V2-Imatrix-GGUF

Text Generation • 4B • Updated 2 days ago • 18 • 1

muranAI/DeepSeek-R1-0528-Qwen3-8B-GGUF

8B • Updated 2 days ago • 151 • 1

onnx-community/distiluse-base-multilingual-v2-merged-onnx

Feature Extraction • Updated 1 day ago • 1

muranAI/gemma-3n-E4B-it-GGUF

Text Generation • 7B • Updated about 17 hours ago • 1

ravenscroftj/CodeGen-350M-multi-ggml-quant

Text Generation • Updated Apr 24, 2023 • 2

ravenscroftj/CodeGen-2B-multi-ggml-quant

Text Generation • Updated Aug 5, 2023 • 2

ravenscroftj/CodeGen-6B-multi-ggml-quant

Text Generation • Updated Apr 24, 2023 • 9

ethzanalytics/dolly-v2-12b-sharded-8bit

Text Generation • Updated Apr 29, 2023 • 20 • 4

ethzanalytics/dolly-v2-7b-sharded-8bit

Text Generation • Updated Jun 28, 2023 • 26 • 1

pszemraj/long-t5-tglobal-xl-16384-book-summary-8bit

Summarization • 3B • Updated Jan 21 • 42

ethzanalytics/stablelm-tuned-alpha-7b-sharded-8bit

Text Generation • Updated May 4, 2023 • 124 • 2

rozek/OpenLLaMA_7B_300BT_q4

Text Generation • Updated May 5, 2023 • 1

ethzanalytics/stablelm-tuned-alpha-3b-gptq-4bit-128g

Text Generation • Updated May 7, 2023 • 25

kyo-takano/open-calm-7b-8bit

Text Generation • Updated May 28, 2023 • 57 • 10

CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA

Text Generation • Updated Jul 20, 2023 • 13

CONCISE/LLaMa_V2-13B-Chat-Uncensored-GGML

Text Generation • Updated Aug 7, 2023 • 14 • 7

CONCISE/LLaMa_V2-13B-Instruct-Uncensored-GGML

Text Generation • Updated Aug 17, 2023 • 28 • 5

rozek/LLaMA-2-7B-32K_GGUF

Text Generation • 7B • Updated Aug 31, 2023 • 438 • 9

rozek/LLaMA-2-7B-32K-Instruct_GGUF

Text Generation • 7B • Updated Aug 31, 2023 • 153 • 4

RedHatAI/bge-small-en-v1.5-quant

Feature Extraction • Updated Nov 13, 2023 • 257 • 9

RedHatAI/bge-base-en-v1.5-quant

Feature Extraction • Updated Nov 13, 2023 • 292 • 4

RedHatAI/bge-large-en-v1.5-quant

Feature Extraction • Updated Nov 13, 2023 • 21.1k • 22

afrideva/TinyLlama-1.1B-intermediate-step-715k-1.5T-GGUF

1B • Updated Nov 4, 2023 • 145

afrideva/tinyllama-colorist-v2-GGUF

Text Generation • 1B • Updated Nov 4, 2023 • 131

afrideva/stablelm-3b-4e1t-GGUF

Text Generation • 3B • Updated Nov 5, 2023 • 335 • 1