Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Mor Geva
mega
Follow
iislucas's profile picture
gsarti's profile picture
2 followers
ยท
1 following
https://mega002.github.io/
megamor2
mega002
AI & ML interests
None yet
Recent Activity
authored
a paper
17 days ago
Universal Jailbreak Suffixes Are Strong Attention Hijackers
authored
a paper
22 days ago
Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
authored
a paper
5 months ago
Open Problems in Mechanistic Interpretability
View all activity
Organizations
Papers
13
arxiv:
2506.12880
arxiv:
2506.10920
arxiv:
2501.16496
arxiv:
2501.08319
Expand 13 papers
models
0
None public yet
datasets
0
None public yet