Some OthelloGPT Circuits
tl;dr * I trained OthelloGPT-small, a 600k param model that predicts legal next moves in 6x6 Othello with 99.3% accuracy * I trained attention SAEs and transcoders on every module output with 94%+ loss recovered * I used the learned features and the static model weights to generate a computational...
Apr 15, 20257