Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Backyard AI
DiffusionBee
Draw Things
Invoke
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
MLX LM
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
TGI
vLLM
Apps with no match
JoyFusion
Inference Providers
Select all
Featherless AI
Inference Providers with no match
Novita
Fireworks
Nebius AI
Together AI
Cerebras
Hyperbolic
Nscale
SambaNova
fal
Groq
Replicate
Cohere
HF Inference API
Misc
Reset Misc
4-bit precision
text-generation-inference
Inference Endpoints
custom_code
8-bit precision
Merge
Eval Results
Mixture of Experts
text-embeddings-inference
Carbon Emissions
Apply filters
Models
36,200
Full-text search
Edit filters
Sort: Trending
Active filters:
4-bit
Clear all
TheBloke/LLaMA-Pro-8B-GPTQ
Text Generation
•
1B
•
Updated
Jan 6, 2024
•
142
•
2
isaiahbjork/tinyllama-function-calling-v0.1-merge
Text Generation
•
0.6B
•
Updated
Jan 18, 2024
•
19
Danielbrdz/Barcenas-Mixtral-8x7b-4bit
Text Generation
•
24B
•
Updated
Jan 6, 2024
•
13
TheBloke/Pallas-0.5-frankenmerge-GPTQ
Text Generation
•
5B
•
Updated
Jan 6, 2024
•
24
•
1
TheBloke/Pallas-0.5-frankenmerge-AWQ
Text Generation
•
6B
•
Updated
Jan 6, 2024
•
13
•
1
TheBloke/bagel-8x7b-v0.2-GPTQ
Text Generation
•
6B
•
Updated
Jan 6, 2024
•
48
•
3
TheBloke/bagel-8x7b-v0.2-AWQ
Text Generation
•
6B
•
Updated
Jan 6, 2024
•
41
•
1
TheBloke/zephyr-quiklang-3b-4K-GPTQ
Text Generation
•
0.6B
•
Updated
Jan 6, 2024
•
18
•
2
TheBloke/Sensualize-Solar-10.7B-GPTQ
Text Generation
•
2B
•
Updated
Jan 6, 2024
•
37
•
8
TheBloke/Sensualize-Solar-10.7B-AWQ
Text Generation
•
2B
•
Updated
Jan 6, 2024
•
17
•
2
TheBloke/sonya-medium-x8-MoE-GPTQ
Text Generation
•
9B
•
Updated
Jan 7, 2024
•
17
•
3
danielhanchen/test2
Text Generation
•
4B
•
Updated
Jan 7, 2024
•
15
danielhanchen/test3
Text Generation
•
4B
•
Updated
Jan 7, 2024
•
20
Delosint/firsttestmodel
Updated
Jan 7, 2024
•
1
TheBloke/LLaMA-Pro-8B-Instruct-GPTQ
Text Generation
•
1B
•
Updated
Jan 7, 2024
•
1.84k
•
5
TheBloke/LLaMA-Pro-8B-Instruct-AWQ
Text Generation
•
1B
•
Updated
Jan 7, 2024
•
1.82k
•
1
TheBloke/Noromaid-13B-v0.3-AWQ
Text Generation
•
2B
•
Updated
Jan 7, 2024
•
13
•
2
TheBloke/Noromaid-13B-v0.3-GPTQ
Text Generation
•
2B
•
Updated
Jan 7, 2024
•
18
•
9
Muhammadreza/Nucleus-1B-GPTQ
Text Generation
•
0.4B
•
Updated
Jan 7, 2024
•
15
•
1
KnutJaegersberg/MoMo-72B-4bit
Text Generation
•
38B
•
Updated
Jan 13, 2024
•
13
danielhanchen/test4
Text Generation
•
7B
•
Updated
Jan 7, 2024
•
15
Anarchist/myLora
Text Generation
•
4B
•
Updated
Jan 7, 2024
•
15
janphilippfranken/Mistral-7B-v0.1-awq
Text Generation
•
Updated
Jan 8, 2024
•
16
Crystalcareai/WagoAWQ
Text Generation
•
1B
•
Updated
Jan 8, 2024
•
22
xinyuanL/awq-AOS-Mistral
Text Generation
•
1B
•
Updated
Jan 8, 2024
•
43
yentinglin/Taiwan-LLM-13B-v2.0-chat-awq
Text Generation
•
2B
•
Updated
Apr 20
•
45
•
4
isaiahbjork/tinyllama-function-calling-v0.2-merge
Text Generation
•
0.6B
•
Updated
Jan 18, 2024
•
14
TheBloke/Mixtral_34Bx2_MoE_60B-GPTQ
Text Generation
•
8B
•
Updated
Jan 8, 2024
•
39
•
7
TheBloke/Mixtral_34Bx2_MoE_60B-AWQ
Text Generation
•
9B
•
Updated
Jan 8, 2024
•
12
•
4
Technoculture/MT7Bi-alpha
Updated
Jan 17, 2024
•
39
Previous
1
...
98
99
100
Next