This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
GPT
Interpretability (ML & AI)
Language Models (LLMs)
Machine Learning (ML)
AI
Frontpage
23
Language models can explain neurons in language
models
by
nz
9th May 2023
AI Alignment Forum
1 min read
0
23
Ω 8
This is a linkpost for
https://openai.com/research/language-models-can-explain-neurons-in-language-models
New Comment
Submit
Moderation Log
More from
nz
151
GPT-4
nz
2y
150
View more
Curated and popular this week
266
Tracing the Thoughts of a Large Language Model
Ω
Adam Jermyn
1d
Ω
24
184
Impact, agency, and taste
benkuhn
5d
10
211
To Understand History, Keep Former Population Distributions In Mind
Arjun Panickssery
7d
12
0
Comments
Previous
Next