I said this in a comment on Real-life entropic weirdness, but it's getting off-topic there, so I'm posting it here.
My original writeup was confusing, because I used some non-standard terminology, and because I wasn't familiar with the crucial theorem. We cleared up the terminological confusion (thanks esp. to conchis and Vladimir Nesov), but the question remains. I rewrote the title yet again, and have here a restatement that I hope is clearer.
- We have a utility function u(outcome) that gives a utility for one possible outcome. (Note the word utility. That means your diminishing marginal utility, and all your preferences, and your aggregation function for a single outcome, are already incorporated into this function. There is no need to analyze u further, as long as we agree on using a utility function.)
- We have a utility function U(lottery) that gives a utility for a probability distribution over all possible outcomes.
- The von Neumann-Morgenstern theorem indicates that, given 4 reasonable axioms about U, the only reasonable form for U is to calculate the expected value of u(outcome) over all possible outcomes. This is why we constantly talk on LW about rationality as maximizing expected utility.
- This means that your utility function U is indifferent with regard to whether the distribution of utility is equitable among your future selves. Giving one future self u=10 and another u=0 is equally as good as giving one u=5 and another u=5.
- This is the same ethical judgement that an average utilitarian makes when they say that, to calculate social good, we should calculate the average utility of the population; modulo the problems that population can change and that not all people are equal. This is clearer if you use a many-worlds interpretation, and think of maximizing expected value over possible futures as applying average utilitarianism to the population of all possible future yous.
- Therefore, I think that, if the 4 axioms are valid when calculating U(lottery), they are probably also valid when calculating not our private utility, but a social utility function s(outcome), which sums over people in a similar way to how U(lottery) sums over possible worlds. The theorem then shows that we should set s(outcome) = the average value of all of the utilities for the different people involved. (In other words, average utilitarianism is correct). Either that, or the axioms are inappropriate for both U and s, and we should not define rationality as maximizing expected utility.
- (I am not saying that the theorem reaches down through U to say anything directly about the form of u(outcome). I am saying that choosing a shape for U(lottery) is the same type of ethical decision as choosing a shape for s(outcome); and the theorem tells us what U(lottery) should look like; and if that ethical decision is right for U(lottery), it should also be right for s(outcome). )
- And yet, average utilitarianism asserts that equity of utility, even among equals, has no utility. This is shocking, especially to Americans.
- It is even more shocking that it is thus possible to prove, given reasonable assumptions, which type of utilitarianism is correct. One then wonders what other seemingly arbitrary ethical valuations actually have provable answers given reasonable assumptions.
Some problems with average utilitarianism from the Stanford Encyclopedia of Philosophy:
Despite these advantages, average utilitarianism has not obtained much acceptance in the philosophical literature. This is due to the fact that the principle has implications generally regarded as highly counterintuitive. For instance, the principle implies that for any population consisting of very good lives there is a better population consisting of just one person leading a life at a slightly higher level of well-being (Parfit 1984 chapter 19). More dramatically, the principle also implies that for a population consisting of just one person leading a life at a very negative level of well-being, e.g., a life of constant torture, there is another population which is better even though it contains millions of lives at just a slightly less negative level of well-being (Parfit 1984). That total well-being should not matter when we are considering lives worth ending is hard to accept. Moreover, average utilitarianism has implications very similar to the Repugnant Conclusion (see Sikora 1975; Anglin 1977).
(If you assign different weights to the utilities of different people, we could probably get the same result by considering a person with weight W to be equivalent to W copies of a person with weight 1.)
I haven't done the math, so take the following with a grain of salt.
We humans care about what will happen in the future. We care about how things will turn out. Call each possible future an "outcome". We humans prefer some outcomes over another. We ought to steer the future towards the outcomes we prefer. Mathematically, we have a (perhaps partial) order on the set of outcomes, and if we had perfect knowledge of how our actions affected the future, our decision procedure would just be "pick the best outcome".
So far I don't think I've said anything controversial.
But we don't have perfect knowledge of the future. We must reason and act under uncertainty. The best we can hope to do is assign conditional probabilities of to outcomes based on our possible actions. But in order to choose actions based on probabilities rather than certainties, we have to know a little more about what our preferences actually are. It's not enough to know that one outcome is better than another, we have to know how much better. Let me give an example. If you are given the choice between winning a little money with probability .1 and a lot of money with probability .01, which option do you choose? Well, I haven't given you enough information. If "a little" is $1 and "a lot" is $1 million, you should go for "a lot". But if "a lot" is only $2, you're better off going for "a little".
So it turns out that if you want to have a consistent decision theory under these circumstances, there's only one form it can take. Instead of a partial order on outcomes, you have to express your preference for each outcome as a number, called the utility of that outcome. And instead of selecting action that leads to the best outcome, you select the actions that lead to the highest expected utility.
The function that maps outcomes to utilities is called the utility function, and it is unique (given your preferences) up to an positive affine transformation. In other words, you can multiply the whole utility function by a positive scalar, or add any number to it, and it's meaning does not change.
So you see, "maximize expected utility" doesn't mean you have to maximize say, your profit, or your personal happiness, or even the number of human lives you save. All it really means is that your preferences ought to be consistent, because if they are, and if you're trying the best you can to steer the future towards them, then "maximizing expected utility" is what you're already doing.