-
-
-
-
-
-
Inference Providers
Active filters:
vllm
RedHatAI/Llama-3.2-90B-Vision-Instruct-FP8-dynamic
Text Generation
•
89B
•
Updated
•
2.76k
•
10
soprasteria/Mixtral-8x7B-Instruct-v0.1-FP8
47B
•
Updated
•
2
RedHatAI/Phi-3.5-mini-instruct-FP8-KV
Text Generation
•
4B
•
Updated
•
5
•
2
RedHatAI/Qwen2.5-0.5B-quantized.w8a16
Text Generation
•
0.4B
•
Updated
•
7
RedHatAI/Qwen2.5-1.5B-quantized.w8a16
Text Generation
•
0.8B
•
Updated
•
8
RedHatAI/Qwen2.5-3B-quantized.w8a16
Text Generation
•
1B
•
Updated
•
10
RedHatAI/Qwen2.5-7B-quantized.w8a16
Text Generation
•
3B
•
Updated
•
14
•
1
RedHatAI/Qwen2.5-32B-quantized.w8a16
Text Generation
•
9B
•
Updated
•
6
RedHatAI/Qwen2.5-72B-quantized.w8a16
Text Generation
•
20B
•
Updated
•
5
RedHatAI/pixtral-12b-FP8-dynamic
Text Generation
•
13B
•
Updated
•
8.55k
•
10
mlx-community/Ministral-8B-Instruct-2410-bf16
8B
•
Updated
•
23
•
2
mlx-community/Ministral-8B-Instruct-2410-4bit
1B
•
Updated
•
189
•
9
mlx-community/Ministral-8B-Instruct-2410-8bit
2B
•
Updated
•
20
•
2
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
•
71B
•
Updated
•
1.05k
•
14
TouchNight/Ministral-8B-Instruct-2410-HF
8B
•
Updated
•
13
TouchNight/Ministral-8B-Instruct-2410-HF-Q5_K_M-GGUF
8B
•
Updated
•
2
ijohn07/Ministral-8B-Instruct-2410-HF-Q8_0-GGUF
8B
•
Updated
•
8
adriabama06/reader-lm-1.5b-AWQ
Text Generation
•
0.4B
•
Updated
•
4
•
1
sasha0552/Ministral-8B-Instruct-2410
aashish1904/Ministral-8B-Instruct-2410-HF-Q4_K_M-GGUF
8B
•
Updated
•
12
•
1
QuantFactory/TouchNight-Ministral-8B-Instruct-2410-HF-GGUF
8B
•
Updated
•
23
•
2
aashish1904/Ministral-8B-Instruct-2410-HF-Q2_K-GGUF
8B
•
Updated
•
5
•
2
GrimsenClory/Ministral-8B-Instruct-2410-Q6_K-GGUF
8B
•
Updated
•
10
QuantFactory/Ministral-8B-Instruct-2410-GGUF
8B
•
Updated
•
111
•
2
gphorvath/Ministral-8B-Instruct-2410-Q4_K_M-GGUF
8B
•
Updated
•
4
Gleisson1/Ministral-8B-Instruct-2410-HF-4bit
5B
•
Updated
•
4
paultimothymooney/Ministral-8B-Instruct-2410-Q8_0-GGUF
8B
•
Updated
•
2
paultimothymooney/Ministral-8B-Instruct-2410-Q4_K_M-GGUF
8B
•
Updated
•
3
LouiSeHU/Mistral-Small-Instruct-2409-Q4_0-GGUF
22B
•
Updated
•
2
yejingfu/nmagic-Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
5.59k