Tracing Typos in LLMs: My Attempt at Understanding How Models Correct Misspellings
This blogpost was created as a part of the AI Safety Fundamentals course by BlueDot Impact. All of the code can be found on my GitHub. TLDR: I tried to uncover if there are specific components in language models that enable typo correction. I identified a subword merging head in...