arxiv:2501.16496
Neel Nanda
NeelNanda
AI & ML interests
Mechanistic Interpretability
Recent Activity
authored
a paper
about 14 hours ago
Open Problems in Mechanistic Interpretability
authored
a paper
2 months ago
Do I Know This Entity? Knowledge Awareness and Hallucinations in
Language Models
updated
a model
3 months ago
NeelNanda/crosscoders-gpt2-small
Organizations
Papers
11
models
65
NeelNanda/crosscoders-gpt2-small
Updated
•
5
NeelNanda/GELU_1L512W_C4_Code
Updated
•
727
•
2
NeelNanda/gpt-neox-tokenizer-digits
Updated
•
2
NeelNanda/sparse_autoencoder
Updated
•
3
NeelNanda/redwood-attn-only-2l
Updated
•
1
NeelNanda/Othello-GPT-Transformer-Lens
Updated
NeelNanda/full_pred_log_probs
Updated
NeelNanda/SoLU_1L256W_C4_Width_Scan
Updated
•
3
NeelNanda/SoLU_1L128W_C4_Width_Scan
Updated
•
3
NeelNanda/SoLU_1L64W_C4_Width_Scan
Updated
•
2
datasets
15
NeelNanda/pile-small-tokenized-2b
Viewer
•
Updated
•
10.8M
•
1.09k
NeelNanda/pile-tokenized-10b
Viewer
•
Updated
•
10.8M
•
156
•
1
NeelNanda/openwebtext-tokenized-9b
Viewer
•
Updated
•
8.83M
•
278
NeelNanda/code-10k
Viewer
•
Updated
•
10k
•
68
•
1
NeelNanda/wiki-10k
Viewer
•
Updated
•
10k
•
59
NeelNanda/c4-code-20k
Viewer
•
Updated
•
20k
•
145
•
4
NeelNanda/c4-10k
Viewer
•
Updated
•
10k
•
438
NeelNanda/c4-tokenized-2b
Viewer
•
Updated
•
1.36M
•
196
NeelNanda/code-tokenized
Viewer
•
Updated
•
297k
•
61
NeelNanda/c4-code-tokenized-2b
Viewer
•
Updated
•
1.66M
•
70
•
1