Kaj_Sotala comments on Open Thread Feb 22 - Feb 28, 2016 - Less Wrong

5 Post author: Elo 21 February 2016 09:14PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (228)

You are viewing a single comment's thread. Show more comments above.

Comment author: halcyon 24 February 2016 02:24:17PM 0 points [-]

Interesting. In that case, would you say an AI that provably implements CEV's replacement is, for that reason, provably Friendly? That is, AIs implementing CEV's replacement form an analytical subset of Friendly AIs? What is the current replacement for CEV anyway? Having some technical material would be even better. If it's open to the public, then I'd like to understand how EY proposes to install a general framework similar to CEV at the "initial dynamic" stage that can predictably generate a provably Friendly AI without explicitly modeling the target of its Friendliness.

Comment author: Kaj_Sotala 26 February 2016 05:45:29PM 1 point [-]

What is the current replacement for CEV anyway?

There isn't really one as far as I know; "The Value Learning Problem" discusses some of the questions involved, but seems to mostly at be the point of defining the problem rather than trying to answer it. (This seems appropriate to me; trying to answer the problem at this point seems premature.)

Comment author: halcyon 28 February 2016 03:32:58PM 1 point [-]

Thanks. That makes sense to me.