Refusal Token Models
Collection
This collection contains models described in the refusal token paper published in COLM 2025.
•
5 items
•
Updated
This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B on UltraChat SFT.
For this model, model.generate
or pipeline
are sufficient.