This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
Mechanistic Interpretability Puzzles
LW
$
Login
Mechanistic Interpretability Puzzles
66
Mech Interp Puzzle 1: Suspiciously Similar Embeddings in GPT-Neo
Ω
Neel Nanda
1y
Ω
15
40
Mech Interp Puzzle 2: Word2Vec Style Embeddings
Ω
Neel Nanda
1y
Ω
4