This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
Tags
LW
$
Login
Myopia
•
Applied to
Non-myopia stories
1y
ago
•
Applied to
How LLMs are and are not myopic
by
janus
1y
ago
•
Applied to
"Corrigibility at some small length" by dath ilan
by
Christopher King
2y
ago
•
Applied to
GPT-4 busted? Clear self-interest when summarizing articles about itself vs when article talks about Claude, LLaMA, or DALL·E 2
by
Christopher King
2y
ago
•
Applied to
A crazy hypothesis: GPT-4 already is agentic and is trying to take over the world!
by
Christopher King
2y
ago
•
Applied to
GPT-4 aligning with acasual decision theory when instructed to play games, but includes a CDT explanation that's incorrect if they differ
by
Christopher King
2y
ago
•
Applied to
Underspecification of Oracle AI
by
Evan R. Murphy
2y
ago
•
Applied to
You can still fetch the coffee today if you're dead tomorrow
by
davidad
2y
ago
•
Applied to
Steering Behaviour: Testing for (Non-)Myopia in Language Models
by
Evan R. Murphy
2y
ago
•
Applied to
Simulators
by
Evan R. Murphy
2y
ago
•
Applied to
Limiting an AGI's Context Temporally
by
Noosphere89
2y
ago
•
Applied to
Generative, Episodic Objectives for Safe AI
by
Michael Glass
2y
ago
•
Applied to
Laziness in AI
by
RobertM
2y
ago
•
Applied to
Acceptability Verification: A Research Agenda
by
David Udell
2y
ago
•
Applied to
Interpretability’s Alignment-Solving Potential: Analysis of 7 Scenarios
by
Evan R. Murphy
3y
ago
•
Applied to
AI safety via market making
by
Evan R. Murphy
3y
ago
•
Applied to
How complex are myopic imitators?
by
Vivek Hebbar
3y
ago