Qwen3-30B-A3B-NVFP4 / recipe.yaml
llmat's picture
Add NVFP4 quantized model for Qwen/Qwen3-30B-A3B.
dbde053 verified
raw
history blame
130 Bytes
default_stage:
default_modifiers:
QuantizationModifier:
targets: [Linear]
ignore: [lm_head]
scheme: NVFP4