NVIDIA L40S GPU's for MXFP4 quantization
#100
by
lordim
- opened
Are NVIDIA L40S GPU's also compatible for MXFP4 quantization? I'm trying to load gpt-oss-20b on this machine, but it seems to default to bf16.
Did it work with bf16?