-
-
-
-
-
-
Inference Providers
Active filters:
gptq
Qwen/Qwen-72B-Chat-Int8
Text Generation
•
Updated
•
42
•
17
AI-Sweden-Models/gpt-sw3-6.7b-v2-instruct-4bit-gptq
Text Generation
•
Updated
•
168
•
6
Pi3141/alpaca-7b-native-enhanced-GPTQ
Text Generation
•
Updated
•
2
AI-Sweden-Models/gpt-sw3-20b-instruct-4bit-gptq
Text Generation
•
Updated
•
128
•
4
TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ
Text Generation
•
Updated
•
368k
•
136
TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
Text Generation
•
Updated
•
15k
•
51
TheBloke/dolphin-2.5-mixtral-8x7b-GPTQ
Text Generation
•
Updated
•
200
•
111
TheBloke/GEITje-7B-chat-GPTQ
Text Generation
•
Updated
•
41
•
4
TheBloke/laser-dolphin-mixtral-2x7b-dpo-GPTQ
Text Generation
•
Updated
•
58
•
11
TheBloke/DareVox-7B-GPTQ
Text Generation
•
Updated
•
20
•
2
TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ
Updated
•
626
•
57
Qwen/Qwen1.5-72B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
66
•
7
Qwen/Qwen1.5-72B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
2.05k
•
37
Qwen/Qwen1.5-14B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
166
•
21
Qwen/Qwen1.5-7B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
98
•
26
Qwen/Qwen1.5-7B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
218
•
18
Qwen/Qwen1.5-4B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
67
•
5
Qwen/Qwen1.5-4B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
126
•
5
Qwen/Qwen1.5-1.8B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
34
•
2
Qwen/Qwen1.5-1.8B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
71
•
7
Qwen/Qwen1.5-0.5B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
54
•
13
Qwen/Qwen1.5-0.5B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
158
•
4
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
345
•
44
Qwen/Qwen1.5-32B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
1.87k
•
30
explodinggradients/Ragas-critic-llm-Qwen1.5-GPTQ
Text Generation
•
Updated
•
12
astronomer/Llama-3-8B-Instruct-GPTQ-4-Bit
Text Generation
•
Updated
•
6.17k
•
25
Qwen/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
31
•
17
IntelLabs/sqft-phi-3-mini-4k-50-base-gptq
Text Generation
•
Updated
•
474
•
2
neuralmagic/Mistral-7B-Instruct-v0.3-GPTQ-4bit
Text Generation
•
Updated
•
1.31k
•
18
allganize/Llama-3-Alpha-Ko-8B-Instruct-marlin
Text Generation
•
Updated
•
30
•
5