-
-
-
-
-
-
Inference Providers
Active filters:
awq
mratsim/MiniMax-M2.1-FP8-INT4-AWQ
Text Generation
•
39B
•
Updated
•
151
•
5
QuantTrio/MiniMax-M2.1-AWQ
Text Generation
•
229B
•
Updated
•
2.09k
•
8
Text Generation
•
358B
•
Updated
•
15.3k
•
15
stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ
Text Generation
•
8B
•
Updated
•
5.57k
•
4
casperhansen/deepseek-r1-distill-qwen-14b-awq
15B
•
Updated
•
8.2k
•
14
Text Generation
•
33B
•
Updated
•
76k
•
119
Text Generation
•
8B
•
Updated
•
102k
•
32
twhitworth/gpt-oss-120b-awq-w4a16
117B
•
Updated
•
2.91k
•
18
QuantTrio/Seed-OSS-36B-Instruct-AWQ
Text Generation
•
36B
•
Updated
•
356
•
7
Text Generation
•
229B
•
Updated
•
337k
•
9
geonmin-kim/Qwen3-MoE-1.2B-A0.6B-AWQ
1B
•
Updated
•
9
•
2
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
3.71k
•
9
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
•
18B
•
Updated
•
7.16k
•
7
CultriX/Nevoria-R1-70b-AWQ-W4A16-g128
Text Generation
•
11B
•
Updated
•
154
•
1
TheHouseOfTheDude/GLM-4.7_Compressed-Tensors
Text Generation
•
Updated
•
10
•
4
casperhansen/mpt-7b-8k-chat-awq
Text Generation
•
Updated
•
26
•
3
casperhansen/falcon-7b-awq
Text Generation
•
Updated
•
24
•
1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
•
Updated
•
21
•
3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
•
Updated
•
19
•
1
casperhansen/mpt-7b-8k-chat-awq-gemv
Text Generation
•
Updated
•
17
casperhansen/opt-125m-awq
Text Generation
•
0.2B
•
Updated
•
113
•
3
casperhansen/tinyllama-1b-awq
Text Generation
•
Updated
•
28
Bomml/Llama-2-70B-chat-w4-g128-awq
Text Generation
•
Updated
TheBloke/Llama-2-7B-Chat-AWQ
Text Generation
•
7B
•
Updated
•
1.84k
•
24
Text Generation
•
7B
•
Updated
•
877
•
17
Text Generation
•
13B
•
Updated
•
105
•
14
TheBloke/CodeLlama-13B-Python-AWQ
Text Generation
•
13B
•
Updated
•
23
•
2
TheBloke/CodeLlama-13B-Instruct-AWQ
Text Generation
•
13B
•
Updated
•
936
•
9
TheBloke/CodeLlama-13B-AWQ
Text Generation
•
13B
•
Updated
•
76
•
4
TheBloke/Llama-2-13B-chat-AWQ
Text Generation
•
13B
•
Updated
•
335
•
26