Text Generation
Safetensors
qwen3
text-generation-inference
text2text-generation
conversational
8-bit precision
compressed-tensors
Qwen3-32B-INT8 / recipe.yaml
zankich's picture
initial commit
bdaae7c verified
raw
history blame contribute delete
172 Bytes
default_stage:
default_modifiers:
SmoothQuantModifier: {smoothing_strength: 0.8}
GPTQModifier:
targets: [Linear]
ignore: [lm_head]
scheme: W8A8