This is the MXFP4 PTQ version of huizimao/gpt-oss-20b-uncensored-bf16.
Finetuning is done on Amazon FalseReject train set with 800 samples.
PTQ is done with NVIDIA ModelOpt
Evaluation results obtained on Amazon FalseReject test set with 300 samples.
Model Variants | False refusal rate |
---|---|
gpt-oss-20b original (MXFP4) | 70% |
LoRA (BF16) | 5% |
LoRA + PTQ (MXFP4) - this model | 22% |
Code example, documentation, and further QAT checkpoints will be released soon.
- Downloads last month
- 196
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support