Roger Dearnaley

Message

Roger Dearnaley

Message

Just How Hard a Problem is Alignment?

It is commonly asserted that aligning AI is extremely hard because 1. human values are complex: they have a high Kolmogorov complexity, and 2. they're fragile: if you get them even a tiny bit wrong, the result is useless, or worse than useless. If these statements are both true, then...

Feb 25, 2023•3

Breaking the Optimizer’s Curse, and Consequences for Existential Risks and Value Learning

Multi-Armed Bandits Considered Harmful People frequently analyze the process of artificial agents gathering knowledge in the framework of explore/exploit strategies for multi-armed bandits. However, a multi-armed bandit is a simplistic black-box abstraction – the possible rewards from pulling each arm have no underlying logic: by definition they’re unknown and unknowable...

Feb 21, 2023•10

Roger Dearnaley

;

Just How Hard a Problem is Alignment?

Feb 25, 2023•3

Breaking the Optimizer’s Curse, and Consequences for Existential Risks and Value Learning

Feb 21, 2023•10

LESSWRONG
LW

LESSWRONG
LW

Roger Dearnaley

Roger Dearnaley

Roger Dearnaley

Roger Dearnaley

Just How Hard a Problem is Alignment?

Breaking the Optimizer’s Curse, and Consequences for Existential Risks and Value Learning

Just How Hard a Problem is Alignment?

Breaking the Optimizer’s Curse, and Consequences for Existential Risks and Value Learning