Use unsloth FP16 GGUF to quantize IQ1_M. AMD Ryzen 7 Pro 8845HS + 128 GB DDR5 5600 RAM in LM Studio with complete GPU offloading yields around 5 token/s.

Downloads last month
160
GGUF
Model size
235B params
Architecture
qwen3moe
Hardware compatibility
Log In to view the estimation

1-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for lovedheart/Qwen3-253B-A22B-IQ1_M

Quantized
(36)
this model