x

LESSWRONG
LW

Jade Bishop

Jade Bishop

Message

10

1

4

7y

Jade Bishop

10

7y

;

Jade Bishop — LessWrong

Can coherent extrapolated volition be estimated with Inverse Reinforcement Learning?

Given the following conditions, is it possible to approximate the coherent extrapolated value of humanity to a "good enough" level?: * Some form of reward/cost function estimation is used, such as inverse reinforcement learning or inverse optimal control. The details of the specific IRL/IOC algorithm in question are not important,...

Apr 15, 2019•12