AI & ML interests

Principled evaluation of mechanistic interpretability methods.

Recent Activity

shunshao  updated a Space 14 days ago
mib-bench/leaderboard
atticusg  updated a Space 15 days ago
mib-bench/leaderboard
atticusg  updated a Space 15 days ago
mib-bench/leaderboard
View all activity