Single SAEs trained on the residual stream activation vectors from every transformer layer simultaneously: https://arxiv.org/abs/2409.04185
Tim Lawson
tim-lawson
·
AI & ML interests
Mechanistic interpretability, language modelling, semantics
Recent Activity
updated
a model
about 1 month ago
tim-lawson/temp-2e8a2069
updated
a model
about 1 month ago
tim-lawson/temp-390cf2d5
published
a model
about 1 month ago
tim-lawson/temp-2e8a2069
Organizations
None yet
Collections
6
Papers
1
models
291

tim-lawson/temp-2e8a2069
Updated
•
5

tim-lawson/temp-390cf2d5
Updated
•
5

tim-lawson/1c81f4e8-a72d-4bbf-bbf7-39f3f0880c29
Updated
•
1

tim-lawson/62c023f5-fe83-487b-8b95-1c8bacd505bf
Updated
•
1

tim-lawson/01f98b06-e240-4115-b2da-ba68f32e8bac
Updated
•
1

tim-lawson/mlsae-gemma-2-2b-x64-k32
Updated
•
1

tim-lawson/mlsae-gemma-2-2b-x64-k32-tfm
Updated

tim-lawson/mlsae-Llama-3.2-3B-x64-k32
Updated

tim-lawson/mlsae-Llama-3.2-3B-x64-k32-tfm
Updated

tim-lawson/temp-pythia-70m-deduped-x64-k32-l3
Updated
datasets
61
tim-lawson/mlsae-pythia-1.4b-deduped-x64-k32-dists
Viewer
•
Updated
•
131k
•
34
tim-lawson/mlsae-Llama-3.2-3B-x64-k32-dists
Viewer
•
Updated
•
197k
•
18
tim-lawson/mlsae-gemma-2-2b-x64-k32-dists
Viewer
•
Updated
•
147k
•
69
tim-lawson/mlsae-gpt2-x64-k32-dists
Preview
•
Updated
•
5
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-11-dists
Viewer
•
Updated
•
49.2k
•
62
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-10-dists
Viewer
•
Updated
•
49.2k
•
10
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-8-dists
Viewer
•
Updated
•
49.2k
•
13
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-9-dists
Viewer
•
Updated
•
49.2k
•
19
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-7-dists
Viewer
•
Updated
•
49.2k
•
66
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-5-dists
Viewer
•
Updated
•
49.2k
•
11