AI & ML interests

Principled evaluation of mechanistic interpretability methods.

Recent Activity

shunshao  updated a Space 10 days ago
mib-bench/leaderboard
atticusg  updated a Space 11 days ago
mib-bench/leaderboard
atticusg  updated a Space 11 days ago
mib-bench/leaderboard
View all activity