-
-
-
-
-
-
Inference Providers
Active filters:
vllm
DBMe/Mistral-Large-Instruct-2411-2.86bpw-h6-exl2
Updated
•
6
•
1
gallantpigeon/mistral-large-instruct-2411-w8a16
31B
•
Updated
•
6
gallantpigeon/mistral-large-instruct-2411-int8-w8a8
123B
•
Updated
•
3
tensorblock/Mistral-Small-Instruct-2409-GGUF
bartowski/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
147
•
3
QuantFactory/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
30
•
4
parasail-ai/GritLM-7B-vllm
Text Generation
•
7B
•
Updated
•
8.23k
•
1
QuantFactory/L3-Aspire-Heart-Matrix-8B-GGUF
Text Generation
•
8B
•
Updated
•
52
•
2
tensorblock/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
619
dangvansam/gemma-2-27b-it-FP8-fix-system-role
Text Generation
•
27B
•
Updated
•
25
dangvansam/gemma-2-2b-it-fix-system-role
Text Generation
•
3B
•
Updated
•
3
dangvansam/gemma-2-9b-it-fix-system-role
Text Generation
•
9B
•
Updated
•
83
•
1
yejingfu/nmagic-Meta-Llama-3-8B-Instruct-FP8
8B
•
Updated
•
2
gghfez/Mistral-Large-Instruct-2411
123B
•
Updated
•
6
jacobcarajo/Ministral-8B-Instruct-2410-Q5_K_M-GGUF
8B
•
Updated
•
1
•
1
vitekkor/T-pro-it-1.0-bnb-8bit
33B
•
Updated
•
3
•
1
itlwas/Ministral-8B-Instruct-2410-Q4_K_M-GGUF
8B
•
Updated
•
7
redhat6/Ministral-8B-Instruct-2410-Q8_0-GGUF
8B
•
Updated
•
2
•
1
itlwas/Mistral-Small-Instruct-2409-Q4_K_M-GGUF
22B
•
Updated
•
15
nintwentydo/pixtral-12b-FP8-dynamic-FP8-KV-cache
Image-Text-to-Text
•
13B
•
Updated
•
2
•
1
matrixportalx/Ministral-8B-Instruct-2410-Q4_0-GGUF
adriabama06/SmallThinker-3B-Preview-AWQ
Text Generation
•
Updated
•
2
•
1
matrixportalx/Ministral-8B-Instruct-2410-Q4_K_M-GGUF
8B
•
Updated
•
7
•
1
matrixportalx/Ministral-8B-Instruct-2410-Q4_K_S-GGUF
RedHatAI/Mixtral-8x22B-v0.1-quantized.w4a16
RedHatAI/Mixtral-8x7B-v0.1-quantized.w4a16
RedHatAI/QwQ-32B-Preview-FP8-dynamic
Text Generation
•
33B
•
Updated
•
7
RedHatAI/QwQ-32B-Preview-quantized.w4a16
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
Text Generation
•
11B
•
Updated
•
30
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8
Text Generation
•
71B
•
Updated
•
5