This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
Tags
LW
$
Login
Mesa-Optimization
•
Applied to
AXRP Episode 38.3 - Erik Jenner on Learned Look-Ahead
by
DanielFilan
10d
ago
•
Applied to
Why Recursive Self-Improvement Might Not Be the Existential Risk We Fear
by
Nassim_A
1mo
ago
•
Applied to
Principled Satisficing To Avoid Goodhart
by
JenniferRM
4mo
ago
•
Applied to
Pacing Outside the Box: RNNs Learn to Plan in Sokoban
by
Adrià Garriga-alonso
5mo
ago
•
Applied to
Finding Backward Chaining Circuits in Transformers Trained on Tree Search
by
abhayesian
7mo
ago
•
Applied to
Inner Optimization Mechanisms in Neural Nets
by
ProgramCrafter
7mo
ago
•
Applied to
The Human's Role in Mesa Optimization
by
silentbob
7mo
ago
•
Applied to
Visualizing neural network planning
by
Nevan Wichers
7mo
ago
•
Applied to
Measuring Learned Optimization in Small Transformer Models
by
J Bostock
8mo
ago
•
Applied to
Understanding mesa-optimization using toy models
by
tilmanr
9mo
ago
•
Applied to
Counting arguments provide no evidence for AI doom
by
Quintin Pope
10mo
ago
•
Applied to
The Inner Alignment Problem
by
Jakub Halmeš
10mo
ago
•
Applied to
Satisficers want to become maximisers
by
JenniferRM
1y
ago
•
Applied to
Mesa-Optimization: Explain it like I'm 10 Edition
by
brook
1y
ago