You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

steven0461 comments on CEV-inspired models - Less Wrong Discussion

7 Post author: Stuart_Armstrong 07 December 2011 06:35PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (41)

You are viewing a single comment's thread.

Comment author: steven0461 07 December 2011 08:17:12PM 4 points [-]

one cheap and easy method (with surprisingly good properties) is to take the maximal possible expected utility (the expected utility that person would get if the AI did exactly what they wanted) as 1, and the minimal possible expected utility (if the AI was to work completely against them) as 0

If Alice likes cookies, and Bob likes cookies but hates whippings, this method gives Alice more cookies than Bob. Moreover, the number of bonus cookies Alice gets depends on the properties of whips that nobody ever uses.

Comment author: Vladimir_Nesov 08 December 2011 04:29:14PM 2 points [-]

(In general, it's proper for properties of counterfactuals to have impact on which decisions are correct in reality, so this consideration alone isn't sufficient to demonstrate that there's a problem.)

Comment author: Stuart_Armstrong 08 December 2011 01:23:34PM 1 point [-]

You can restrict to a Pareto boundary before normalising - not as mathematically elegant, but indifferent to effects "that nobody ever wants/uses".