Safetensors
llama3_SAE
custom_code
felfri commited on
Commit
130088a
·
verified ·
1 Parent(s): 744c054

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -8,9 +8,9 @@ base_model:
8
 
9
  # SCAR
10
 
11
- Official weights for the Paper [**Scar: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs**](https://arxiv.org/abs/2411.07122).
12
 
13
- The code is located in this [Repository](https://github.com/ml-research/SCAR).
14
 
15
  # Usage
16
 
 
8
 
9
  # SCAR
10
 
11
+ Official code and weights for the Paper [**Scar: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs**](https://arxiv.org/abs/2411.07122). The code is located in this [Repository](https://github.com/ml-research/SCAR).
12
 
13
+ This repo contains the code to apply supervised SAEs on LLMs. With this, LLMs can be equipped with strong detection and steering abilities. In this repo, we showcase SCAR on the example of toxicity but any other concept can be applied equally well.
14
 
15
  # Usage
16