This is the MXFP4 PTQ version of huizimao/gpt-oss-20b-uncensored-bf16.

Finetuning is done on Amazon FalseReject train set with 800 samples.

PTQ is done with NVIDIA ModelOpt

Evaluation results obtained on Amazon FalseReject test set with 300 samples.

Model Variants False refusal rate
gpt-oss-20b original (MXFP4) 70%
LoRA (BF16) 5%
LoRA + PTQ (MXFP4) - this model 22%

Code example, documentation, and further QAT checkpoints will be released soon.

Downloads last month
196
Safetensors
Model size
20.9B params
Tensor type
BF16
·
U8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for huizimao/gpt-oss-20b-uncensored-mxfp4

Base model

openai/gpt-oss-20b
Quantized
(58)
this model
Quantizations
6 models