Can coherent extrapolated volition be estimated with Inverse Reinforcement Learning?
Given the following conditions, is it possible to approximate the coherent extrapolated value of humanity to a "good enough" level?: * Some form of reward/cost function estimation is used, such as inverse reinforcement learning or inverse optimal control. The details of the specific IRL/IOC algorithm in question are not important,...
Apr 15, 201912