Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Fireworks
SambaNova
Together AI
Novita
Nebius AI Studio
Hyperbolic
Replicate
Cerebras
Cohere
HF Inference API
Misc
Reset Misc
vLLM
Inference Endpoints
text-generation-inference
4-bit precision
AutoTrain Compatible
custom_code
8-bit precision
Eval Results
Misc with no match
Merge
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
10
Full-text search
Edit filters
Sort: Trending
Active filters:
vLLM
Clear all
prithivMLmods/Galactic-Qwen-14B-Exp2
Text Generation
•
Updated
19 days ago
•
446
•
5
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
•
Updated
Jul 17, 2024
•
6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
•
Updated
Jul 23, 2024
•
45
•
2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
•
Updated
Nov 4, 2024
•
2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
•
Updated
Nov 4, 2024
mradermacher/Galactic-Qwen-14B-Exp2-GGUF
Updated
18 days ago
•
95
•
1
mradermacher/Galactic-Qwen-14B-Exp2-i1-GGUF
Updated
18 days ago
•
237
•
1
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
•
Updated
20 days ago
•
17
•
2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
Updated
18 days ago
•
416
•
1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
Updated
18 days ago
•
522
•
1