x

LESSWRONG
is fundraising!
LW

Jannik Brinkmann — LessWrong

Jannik Brinkmann

140000

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by

No Comments Found

No wikitag contributions to display.

38Evaluating Sparse Autoencoders with Board Game Models

1y

1

75Interpreting Preference Models w/ Sparse Autoencoders

1y

12

52Finding Backward Chaining Circuits in Transformers Trained on Tree Search

2y

1

26Improving SAE's by Sqrt()-ing L1 & Removing Lowest Activating Features

2y

5