x

LESSWRONG
LW

Henrik Åslund — LessWrong

Henrik Åslund

Henrik Åslund

Message

18

Ω

8

1

1

7y

Henrik Åslund

18

Ω

8

7y

;

Corrigibility as Constrained Optimisation

This post is coauthored with Ryan Carey. Much of the work on developing a corrigible agent has focused on ensuring that an AI will not manipulate the shutdown button or any other kind of device that the human operator would use to control it. Suppose, however, that the AI lacked...

Apr 11, 2019•15