You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

ChristianKl comments on Open Thread Feb 22 - Feb 28, 2016 - Less Wrong Discussion

5 Post author: Elo 21 February 2016 09:14PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (228)

You are viewing a single comment's thread. Show more comments above.

Comment author: halcyon 23 February 2016 10:49:42PM *  2 points [-]

The SEP says that preferences cannot be aggregated without additional constraints on how the aggregation is to be done, and the end result changes depending on things like the order of aggregation, so these additional constraints take on the quality of arbitrariness. How does CEV get around that problem?

Comment author: ChristianKl 24 February 2016 12:46:30AM 1 point [-]

I think that's on the list of MIRI open research problems.

Comment author: halcyon 24 February 2016 02:24:17PM 0 points [-]

Interesting. In that case, would you say an AI that provably implements CEV's replacement is, for that reason, provably Friendly? That is, AIs implementing CEV's replacement form an analytical subset of Friendly AIs? What is the current replacement for CEV anyway? Having some technical material would be even better. If it's open to the public, then I'd like to understand how EY proposes to install a general framework similar to CEV at the "initial dynamic" stage that can predictably generate a provably Friendly AI without explicitly modeling the target of its Friendliness.

Comment author: Kaj_Sotala 26 February 2016 05:45:29PM 1 point [-]

What is the current replacement for CEV anyway?

There isn't really one as far as I know; "The Value Learning Problem" discusses some of the questions involved, but seems to mostly at be the point of defining the problem rather than trying to answer it. (This seems appropriate to me; trying to answer the problem at this point seems premature.)

Comment author: halcyon 28 February 2016 03:32:58PM 1 point [-]

Thanks. That makes sense to me.

Comment author: ChristianKl 24 February 2016 02:38:55PM 1 point [-]

Interesting. In that case, would you say an AI that provably implements CEV's replacement is, for that reason, provably Friendly?

I think that's MIRI's usage of the term friendly.

If it's open to the public, then I'd like to understand how EY proposes to install a general framework similar to CEV at the "initial dynamic" stage

He's not proposing a mechanism as far as I know. That's another open problem.

Comment author: Gunnar_Zarncke 24 February 2016 09:30:25PM -1 points [-]

See Miris research for details.