Stuart_Armstrong comments on Heroin model: AI "manipulates" "unmanipulatable" reward - Less Wrong

6 Post author: Stuart_Armstrong 22 September 2016 10:27AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (10)

You are viewing a single comment's thread. Show more comments above.

Comment author: chron 22 September 2016 06:57:01PM 2 points [-]

Well in a sense U(++,-) itself contradicts μ. After all in when given heroin the human seeks it out and acquires more utility than not seeking it out, why doesn't the human seek it out volunterily?

Comment author: Stuart_Armstrong 23 September 2016 09:52:08AM 1 point [-]

Replace "force the human to take heroin" with "gives the human a single sock" and "the human subsequently seeks out heroin" with "the human subsequently seeks out another sock". The formal structure of this can correspond to something quite acceptable.