Quantized using the default exllamav3 (0.0.4) quantization process.
- Original model: unsloth/GLM-Z1-32B-0414 - refer for more details on the model.
- exllamav3: https://github.com/turboderp-org/exllamav3
EXL3 quants available:
- 5.0bpw
- Go to "Files and versions", then click on "Main" to choose your quant
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support