This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
LW
$
Login
Jannik Brinkmann
Posts
Sorted by New
38
Evaluating Sparse Autoencoders with Board Game Models
5mo
1
74
Interpreting Preference Models w/ Sparse Autoencoders
Ω
6mo
Ω
12
50
Finding Backward Chaining Circuits in Transformers Trained on Tree Search
7mo
1
26
Improving SAE's by Sqrt()-ing L1 & Removing Lowest Activating Features
Ω
9mo
Ω
5
Wiki Contributions
Comments
Sorted by
Newest