Exploring the Residual Stream of Transformers for Mechanistic Interpretability — Explained
— by Zeping Yu, Dec 24, 2023 In this post, I present the paper “Exploring the Residual Stream of Transformers”. We aim to locate important parameters containing knowledge for next word prediction, and find the mechanism of the model to merge the knowledge into the final embedding. For sentence “The...