wedrifid comments on Holden's Objection 1: Friendliness is dangerous - Less Wrong

11 Post author: PhilGoetz 18 May 2012 12:48AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (428)

You are viewing a single comment's thread. Show more comments above.

Comment author: wedrifid 26 May 2012 02:45:54AM 0 points [-]

If a majority of humanity wishes to kill a minority, obviously there won't be a consensus to stop the killing, and AI will not interfere.

This assumes that CEV uses something along the lines of a simulated vote as an aggregation mechanism. Currently the method of aggregation is undefined so we can't say this with confidence - certainly not as something obvious.

Comment author: DanArmak 26 May 2012 08:29:36AM 0 points [-]

I agree. However, if the CEV doesn't privilege any value separately from how many people value it how much (in EV), and if the EV of a large majority values killing a small minority (whose EV is of course opposed), and if you have protection against both positive and negative utility monsters (so it's at least not obvious and automatic that the negative value of the minority would outweigh the positive value of the majority) - then my scenario seems to me to be plausible, and an explanation is necessary as to how it might be prevented.

Of course you could say that until CEV is really formally specified, and we know how the aggregation works, this explanation cannot be produced.

Comment author: wedrifid 26 May 2012 11:43:22AM 0 points [-]

my scenario seems to me to be plausible, and an explanation is necessary as to how it might be prevented.

Absolutely, on both counts.