Llamacpp Quantizations of Qwen3-235B-A22B

All quants made based on bartowski1182-llama.cpp.

Q2_K : 77.60 GiB (2.84 BPW)

Q4_K_M : 133.27 GiB (4.87 BPW)

GGUF

Model size

235B params

Architecture

qwen3moe

Hardware compatibility

2-bit

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for bobchenyx/Qwen3-235B-A22B-Thinking-2507-GGUF

Base model

Quantized

(29)

this model