Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

438

Full-text search

Active filters: llama-3.1

hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4

Text Generation • 2B • Updated Aug 7, 2024 • 13.8k • 39

AstroMLab/AstroSage-8B

Text Generation • 8B • Updated Nov 16, 2024 • 1.4k • • 12

DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters

Updated Jul 27 • 136

DavidAU/L3.1-Dark-Reasoning-LewdPlay-evo-Hermes-R1-Uncensored-8B

Text Generation • 8B • Updated Jul 28 • 180 • 24

hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4

Text Generation • 59B • Updated Sep 13, 2024 • 781 • 36

hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4

Text Generation • 2B • Updated Aug 7, 2024 • 97.8k • 76

hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4

Text Generation • 11B • Updated Aug 7, 2024 • 324k • 105

hugging-quants/Meta-Llama-3.1-405B-Instruct-GPTQ-INT4

Text Generation • 59B • Updated Aug 7, 2024 • 1.62k • 16

hugging-quants/Meta-Llama-3.1-405B-BNB-NF4

Text Generation • 211B • Updated Sep 16, 2024 • 21 • 2

hugging-quants/Meta-Llama-3.1-405B-Instruct-BNB-NF4

Text Generation • 214B • Updated Sep 16, 2024 • 12 • 5

hugging-quants/Meta-Llama-3.1-8B-Instruct-BNB-NF4

Text Generation • 5B • Updated Aug 8, 2024 • 955 • 8

hugging-quants/Meta-Llama-3.1-8B-BNB-NF4

Text Generation • 5B • Updated Aug 8, 2024 • 18 • 1

ModelCloud/Meta-Llama-3.1-8B-Instruct-gptq-4bit

Text Generation • 2B • Updated Jul 29, 2024 • 605 • 4

brittlewis12/Meta-Llama-3.1-8B-Instruct-GGUF

Text Generation • 8B • Updated Jul 27, 2024 • 1.3k • 1

ModelCloud/Meta-Llama-3.1-8B-gptq-4bit

Text Generation • 2B • Updated Jul 26, 2024

ModelCloud/Meta-Llama-3.1-70B-Instruct-gptq-4bit

Text Generation • 11B • Updated Jul 27, 2024 • 20 • 4

ValiantLabs/Llama3.1-8B-Fireplace2

Text Generation • 8B • Updated Mar 12 • 13 • • 6

vicgalle/Configurable-Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Jul 24, 2024 • 536 • • 16

OpenBuddy/openbuddy-llama3.1-8b-v22.1-131k

Text Generation • 8B • Updated Jul 24, 2024 • 8 • • 5

hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4

Text Generation • 11B • Updated Aug 7, 2024 • 3k • 23

sunnyyy/openbuddy-llama3.1-8b-v22.1-131k-Q4_K_M-GGUF

Text Generation • 8B • Updated Jul 25, 2024 • 10

jburmeister/Meta-Llama-3.1-70B-Instruct-AWQ-INT4

Text Generation • 11B • Updated Jul 25, 2024

jburmeister/Meta-Llama-3.1-405B-Instruct-AWQ-INT4

Text Generation • 59B • Updated Jul 25, 2024 • 1

azhiboedova/Meta-Llama-3.1-8B-Instruct-AQLM-2Bit-1x16

Text Generation • 2B • Updated Aug 28, 2024 • 2 • 13

mudler/Llama3.1-8B-Fireplace2-Q4_K_M-GGUF

Text Generation • 8B • Updated Jul 27, 2024 • 9

mradermacher/Configurable-Llama-3.1-8B-Instruct-GGUF

8B • Updated Jul 29, 2024 • 208

OpenBuddy/openbuddy-llama3.1-8b-v22.2-131k

Text Generation • 8B • Updated Aug 11, 2024 • 9 • • 2

mradermacher/openbuddy-llama3.1-8b-v22.1-131k-GGUF

8B • Updated Jul 29, 2024 • 170

mradermacher/openbuddy-llama3.1-8b-v22.2-131k-GGUF

8B • Updated Jul 28, 2024 • 314

hugging-quants/Meta-Llama-3.1-405B-BNB-NF4-BF16

Text Generation • 111B • Updated Sep 16, 2024 • 16 • 2