submarat

Message

SWE/MLE/AI Safety

Transformers Don't Need LayerNorm at Inference Time: Implications for Interpretability

This work was produced during MARS and SPAR. arXiv version available at https://arxiv.org/abs/2507.02559. Code on GitHub and models on HuggingFace. TL;DR we scaled LayerNorm (LN) removal by fine-tuning to GPT-2 XL: * We improve training stability by regularizing activation standard deviation across token positions & improve the training code. *...

Jul 23, 202531

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct

Epistemic status: My coauthor and I are both noobs in this field. Expect errors and conceptual flaws. tl;dr For a four-day capstone project for the ARENA program my partner and I did a replication of the MELBO Lesswrong article using Llama-3.2-1b-Instruct. Intro I've been spending the last month at the...

Oct 5, 202434

LESSWRONG
LW

LESSWRONG
LW

submarat

submarat

Transformers Don't Need LayerNorm at Inference Time: Implications for Interpretability

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct

submarat

submarat

Transformers Don't Need LayerNorm at Inference Time: Implications for Interpretability

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct

Introduction

tl;dr

Intro