Model Immunization from a Condition Number Perspective
Abstract
An algorithm with regularization terms based on the condition number of a Hessian matrix is proposed to analyze and achieve model immunization, demonstrating effectiveness on both linear models and deep-nets.
Model immunization aims to pre-train models that are difficult to fine-tune on harmful tasks while retaining their utility on other non-harmful tasks. Though prior work has shown empirical evidence for immunizing text-to-image models, the key understanding of when immunization is possible and a precise definition of an immunized model remain unclear. In this work, we propose a framework, based on the condition number of a Hessian matrix, to analyze model immunization for linear models. Building on this framework, we design an algorithm with regularization terms to control the resulting condition numbers after pre-training. Empirical results on linear models and non-linear deep-nets demonstrate the effectiveness of the proposed algorithm on model immunization. The code is available at https://github.com/amberyzheng/model-immunization-cond-num.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- How to Enhance Downstream Adversarial Robustness (almost) without Touching the Pre-Trained Foundation Model? (2025)
- Targeted Forgetting of Image Subgroups in CLIP Models (2025)
- Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer (2025)
- Update Your Transformer to the Latest Release: Re-Basin of Task Vectors (2025)
- Leaner Transformers: More Heads, Less Depth (2025)
- Enhancing Pre-Trained Model-Based Class-Incremental Learning through Neural Collapse (2025)
- Residual Feature Integration is Sufficient to Prevent Negative Transfer (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper