-
-
-
-
-
-
Inference Providers
Active filters:
int8
FriendliAI/Meta-Llama-3.1-70B-Instruct-int8
Text Generation
•
Updated
•
8
neuralmagic/Qwen2.5-0.5B-quantized.w8a16
Text Generation
•
Updated
•
30
neuralmagic/Qwen2.5-1.5B-quantized.w8a16
Text Generation
•
Updated
•
20
neuralmagic/Qwen2.5-3B-quantized.w8a16
Text Generation
•
Updated
•
20
neuralmagic/Qwen2.5-32B-quantized.w8a16
Text Generation
•
Updated
•
146
neuralmagic/Qwen2.5-72B-quantized.w8a16
Text Generation
•
Updated
•
63
avans06/Meta-Llama-3.1-8B-Instruct-ct2-int8_float16
Text Generation
•
Updated
•
9
avans06/Meta-Llama-3.2-8B-Instruct-ct2-int8_float16
Text Generation
•
Updated
•
39
SteveTran/T5-small-query-expansion-INT8
Text2Text Generation
•
Updated
•
26
mradermacher/ecastera-eva-westlake-7b-spanish-GGUF
Updated
•
169
NeoChen1024/Dolphin3.0-Llama3.1-8B-W8A8
NeoChen1024/dolphin-2.9.3-mistral-7B-32k-W8A8
neuralmagic/granite-3.1-2b-instruct-quantized.w8a8
Text Generation
•
Updated
•
98
neuralmagic/granite-3.1-2b-base-quantized.w8a8
Text Generation
•
Updated
•
138
neuralmagic/granite-3.1-8b-base-quantized.w8a8
Text Generation
•
Updated
•
49
neuralmagic/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w8a8
Text Generation
•
Updated
•
366
•
1
neuralmagic/Pixtral-Large-Instruct-2411-hf-quantized.w8a8
Image-Text-to-Text
•
Updated
•
111
labaispeak/stable-diffusion-2-1-openvino-int8
Text-to-Image
•
Updated