Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Efficient-ML
's Collections
Qwen3-Quantization
LLaMA3-Quantization
Qwen3-Quantization
updated
May 12
This is the official quantized models collection of Qwen3 Quantization
Upvote
6
Efficient-ML/Qwen3-0.6B-base-gptq-w4-128
Updated
May 5
Efficient-ML/Qwen3-0.6B-base-gptq-w8-128
Updated
May 5
Efficient-ML/Qwen3-0.6B-base-gptq-w8-perchannel
Updated
May 5
Efficient-ML/Qwen3-0.6B-base-gptq-w4-perchannel
Updated
May 5
Efficient-ML/Qwen3-1.7B-base-gptq-w4-128
Updated
May 5
Efficient-ML/Qwen3-1.7B-base-gptq-w4-perchannel
Updated
May 5
Efficient-ML/Qwen3-1.7B-base-gptq-w8-128
Updated
May 5
Efficient-ML/Qwen3-1.7B-base-gptq-w8-perchannel
Updated
May 5
Efficient-ML/Qwen3-4B-base-gptq-w4-128
Updated
May 5
Efficient-ML/Qwen3-4B-base-gptq-w8-128
Updated
May 5
Efficient-ML/Qwen3-4B-base-gptq-w8-perchannel
Updated
May 5
Efficient-ML/Qwen3-4B-base-gptq-w4-perchannel
Updated
May 5
Efficient-ML/Qwen3-8B-base-gptq-w4-128
Updated
May 5
Efficient-ML/Qwen3-8B-base-gptq-w8-128
Updated
May 5
Efficient-ML/Qwen3-8B-base-gptq-w4-perchannel
Updated
May 5
Efficient-ML/Qwen3-8B-base-gptq-w8-perchannel
Updated
May 5
Efficient-ML/Qwen3-14B-base-gptq-w4-128
Updated
May 6
Efficient-ML/Qwen3-14B-base-gptq-w4-perchannel
Updated
May 6
Efficient-ML/Qwen3-14B-base-gptq-w8-128
Updated
May 7
Efficient-ML/Qwen3-14B-base-gptq-w8-perchannel
Updated
May 7
Efficient-ML/Qwen3-0.6B-gptq-w8-128
Updated
May 6
Efficient-ML/Qwen3-0.6B-gptq-w4-128
Updated
May 6
Efficient-ML/Qwen3-0.6B-gptq-w4-perchannel
Updated
May 6
Efficient-ML/Qwen3-0.6B-gptq-w8-perchannel
Updated
May 6
Efficient-ML/Qwen3-1.7B-gptq-w4-128
Updated
May 6
Efficient-ML/Qwen3-1.7B-gptq-w4-perchannel
Updated
May 6
Efficient-ML/Qwen3-1.7B-gptq-w8-128
Updated
May 6
Efficient-ML/Qwen3-1.7B-gptq-w8-perchannel
Updated
May 6
Efficient-ML/Qwen3-4B-gptq-w4-128
Updated
May 6
Efficient-ML/Qwen3-4B-gptq-w4-perchannel
Updated
May 6
Efficient-ML/Qwen3-4B-gptq-w8-128
Updated
May 7
Efficient-ML/Qwen3-4B-gptq-w8-perchannel
Updated
May 7
Efficient-ML/Qwen3-8B-gptq-w4-128
Updated
May 7
Efficient-ML/Qwen3-8B-gptq-w4-perchannel
Updated
May 7
Efficient-ML/Qwen3-8B-gptq-w8-128
Updated
May 7
Efficient-ML/Qwen3-8B-gptq-w8-perchannel
Updated
May 7
Efficient-ML/Qwen3-14B-gptq-w4-perchannel
Updated
May 7
Efficient-ML/Qwen3-14B-gptq-w4-128
Updated
May 7
•
1
Efficient-ML/Qwen3-14B-gptq-w8-128
Updated
May 7
Efficient-ML/Qwen3-14B-gptq-w8-perchannel
Updated
May 7
An Empirical Study of Qwen3 Quantization
Paper
•
2505.02214
•
Published
May 4
•
25
Efficient-ML/Qwen3-awq
Updated
May 7
Efficient-ML/GPTQ-for-Qwen3
Updated
May 12
Upvote
6
+2
Share collection
View history
Collection guide
Browse collections