RobbBB comments on Why you must maximize expected utility - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (75)
I appreciate the hard work here, but all the math sidesteps the real problems, which are in the axioms, particularly the axiom of independence. See this sequence of comments on my post arguing that saying expectation maximization is correct is equivalent to saying that average utilitarianism is correct.
People object to average utilitarianism because of certain "repugnant" scenarios, such as the utility monster (a single individual who enjoys torturing everyone else so much that it's right to let him or her do so). Some of these scenarios can be transformed into a repugnant scenario for expectation maximization over your own utility function, where instead of "one person" you have "one possible future you". Suppose the world has one billion people. Do you think it's better to give one billion and one utilons to one person than to give one utilon to everyone? If so, why would you believe it's better to take an action that results in you having one billion and one utilons one-one-billionth of the time, and nothing all other times, than an action that reliably gives you one utilon?
The way people think about the lottery suggests that most people prefer to distribute utilons equally among different people, but to lump them together and give them to a few winners in distributions among their possible future selves. This is a case where we reliably violate the Golden Rule, and call ourselves virtuous for doing so.
One possible response is that the former action is preferable, but the intuition pump yields a different result because our intuitions are informed by actual small and large rewards (e.g., money), and in the real world getting $1 every day for eight years with certainty does not have the same utility as getting $2922 with probability 1/2922 each day for the next eight years. If real-world examples like money -- which is almost always more valuable now than later, inflation aside; and which bears hidden and nonlinearly changing utilities like 'security' and 'versatility' and 'social status' and 'peace of mind' that we learn to reason with intuitively as though they could not be quantified in a single utility metric analogous to the currency measure itself -- are the only intuitive grasp we have on 'utilons,' then we may make systematic errors in trying to cash out how our values would, if we better understood our biases, be reflectively cashed out.