Matt Levinson

Message

Activation Magnitudes Matter On Their Own: Insights from Language Model Distributional Analysis

In my previous post, I explored the distributional properties of transformer activations, finding that they follow mixture distributions dominated by logistic-like or even heavier tailed primary components with minor modes in the tails and sometimes in the shoulders. Note that I have entirely ignored dimension here, treating each value in...

Jan 10, 20254

Beyond Gaussian: Language Model Representations and Distributions

In January 2023, beren and Eric Winsor cataloged basic distributional properties of weights, activations, and gradients in GPT-2 models, providing a systematic view of model internals (thanks to Ryan Greenblatt for the pointer). This post extends their investigation in two directions. First, examining their characterization of transformer activations as "nearly...

Nov 24, 20246

LESSWRONG
LW

LESSWRONG
LW

Matt Levinson

Matt Levinson

Matt Levinson

Matt Levinson

Activation Magnitudes Matter On Their Own: Insights from Language Model Distributional Analysis

Beyond Gaussian: Language Model Representations and Distributions

Activation Magnitudes Matter On Their Own: Insights from Language Model Distributional Analysis

Beyond Gaussian: Language Model Representations and Distributions