Expected utility without the independence axiom

Stuart_Armstrong

27 Expected utility without the independence axiom

28th Oct 2009

5 min read

27

John von Neumann and Oskar Morgenstern developed a system of four axioms that they claimed any rational decision maker must follow. The major consequence of these axioms is that when faced with a decision, you should always act solely to increase your expected utility. All four axioms have been attacked at various times and from various directions; but three of them are very solid. The fourth - independence - is the most controversial.

To understand the axioms, let A, B and C be lotteries - processes that result in different outcomes, positive or negative, with a certain probability of each. For 0<p<1, the mixed lottery pA + (1-p)B implies that you have p chances of being in lottery A, and (1-p) chances of being in lottery B. Then writing A>B means that you prefer lottery A to lottery B, A<B is the reverse and A=B means that you are indifferent between the two. Then the von Neumann-Morgenstern axioms are:

(Completeness) For every A and B either A<B, A>B or A=B.
(Transitivity) For every A, B and C with A>B and B>C, then A>C.
(Continuity) For every A>B>C then there exist a probability p with B=pA + (1-p)C.
(Independence) For every A, B and C with A>B, and for every 0<t≤1, then tA + (1-t)C > tB + (1-t)C.

In this post, I'll try and prove that even without the Independence axiom, you should continue to use expected utility in most situations. This requires some mild extra conditions, of course. The problem is that although these conditions are considerably weaker than Independence, they are harder to phrase. So please bear with me here.

The whole insight in this post rests on the fact that a lottery that has 99.999% chance of giving you £1 is very close to being a lottery that gives you £1 with certainty. I want to express this fact by looking at the narrowness of the probability distribution, using the standard deviation. However, this narrowness is not an intrinsic property of the distribution, but of our utility function. Even in the example above, if I decide that receiving £1 gives me a utility of one, while receiving zero gives me a utility of minus ten billion, then I no longer have a narrow distribution, but a wide one. So, unlike the traditional set-up, we have to assume a utility function as being given. Once this is chosen, this allows us to talk about the mean and standard deviation of a lottery.

Then if you define c(μ) as the lottery giving you a certain return of μ, you can use the following axiom instead of independence:

(Standard deviation bound) For all ε>0, there exists a δ>0 such that for all μ>0, then any lottery B with mean μ and standard deviation less that μδ has B>c((1-ε)μ).

This seems complicated, but all that it says, in mathematical terms, is that if we have a probability distribution that is "narrow enough" around its mean μ, then we should value it are being very close to a certain return of μ. The narrowness is expressed in terms of its standard deviation - a lottery with zero SD is a guaranteed return of μ, and as the SD gets larger, the distribution gets wider, and the chances of getting values far away from μ increases. So risk, in other words, scales (approximately) with the SD.

We also need to make sure that we are not risk loving - if we are inveterate gamblers for the point of being gamblers, our behaviour may be a lot more complicated.

(Not risk loving) If A has mean μ>0, then A≤c(μ).

I.e. we don't love a worse rate of return just because of the risk. This axiom can and maybe should be weakened, but it's a good approximation for the moment - most people are not risk loving with huge risks.

Assume you are going to be have to choose n different times whether to accept independent lotteries with fixed mean β>0, and all with SD less than a fixed upper-bound K. Then if you are not risk loving and n is large enough, you must accept an arbitrarily large proportion of the lotteries.

Proof: From now on, I'll use a different convention for adding and scaling lotteries. Treating them as random variables, A+B will mean the lottery consisting of A and B together, while xA will mean the same lottery as A, but with all returns (positive or negative) scaled by x.

Let X₁, X₂, ... , X_n be these n independent lotteries, with means β and variances v_j. The since the standard deviations are less than K, the variances must be less than K².

Let Y = X₁ + X₂ + ... + X_n. The mean of Y is nβ. The variance of Y is the sum of the v_j, which is less than nK². Hence the SD of Y is less than K√(n). Now pick an ε>0, and the resulting δ>0 from the standard deviation bound axiom. For large enough n, nβδ must be larger than K√(n); hence, for large enough n, Y > c((1-ε)nβ). Now, if we were to refuse more that εn of the lotteries, we would be left with a distribution with mean ≤ (1-ε)nβ, which, since we are not risk loving, is worse than c((1-ε)nβ), which is worse than Y. Hence we must accept more than a proportion (1-ε) of the lotteries on offer. ♦

This only applies to lotteries that share the same mean, but we can generalise the result as:

Assume you are going to be have to choose n different times whether to accept independent lotteries all with means greater than a fixed β>0, and all with SD less than a fixed upper-bound K. Then if you are not risk loving and n is large enough, you must accept lotteries whose means represent an arbitrarily large proportion of the total mean of all lotteries on offer.

Proof: The same proof works as before, with nβ now being a lower bound on the true mean μ of Y. Thus we get Y > c((1-ε)μ), and we must accept lotteries whose total mean is greater than (1-ε)μ. ♦

Analysis: Since we rejected independence, we must now consider the lotteries when taken as a whole, rather than just seeing them individually. When considered as a whole, "reasonable" lotteries are more tightly bunched around their total mean than they are individually. Hence the more lotteries we consider, the more we should treat them as if only their mean mattered. So if we are not risk loving, and expect to meet many lotteries with bounded SD in our lives, we should follow expected utility. Deprived of independence, expected utility sneaks in via aggregation.

Note: This restates the first half of my previous post - a post so confusingly written it should be staked through the heart and left to die on a crossroad at noon.

Edit: Rewrote a part to emphasis the fact that a utility function needs to be chosen in advance - thanks to Peter de Blanc and Nick Hay for bringing this up.

UtilitarianismUtility Functions

Personal Blog

27

New Comment

Rendering 0/68 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 6:55 PM

Moderation Log

27 Expected utility without the independence axiom

by Stuart_Armstrong

28th Oct 2009

5 min read

27

(Completeness) For every A and B either A<B, A>B or A=B.
(Transitivity) For every A, B and C with A>B and B>C, then A>C.
(Continuity) For every A>B>C then there exist a probability p with B=pA + (1-p)C.
(Independence) For every A, B and C with A>B, and for every 0<t≤1, then tA + (1-t)C > tB + (1-t)C.

Then if you define c(μ) as the lottery giving you a certain return of μ, you can use the following axiom instead of independence:

(Standard deviation bound) For all ε>0, there exists a δ>0 such that for all μ>0, then any lottery B with mean μ and standard deviation less that μδ has B>c((1-ε)μ).

We also need to make sure that we are not risk loving - if we are inveterate gamblers for the point of being gamblers, our behaviour may be a lot more complicated.

(Not risk loving) If A has mean μ>0, then A≤c(μ).

Let X₁, X₂, ... , X_n be these n independent lotteries, with means β and variances v_j. The since the standard deviations are less than K, the variances must be less than K².

This only applies to lotteries that share the same mean, but we can generalise the result as:

Note: This restates the first half of my previous post - a post so confusingly written it should be staked through the heart and left to die on a crossroad at noon.

Edit: Rewrote a part to emphasis the fact that a utility function needs to be chosen in advance - thanks to Peter de Blanc and Nick Hay for bringing this up.

UtilitarianismUtility Functions

Personal Blog

27

Mentioned in

339On The Independence Axiom

74Research Agenda v0.9: Synthesising a human's preferences into a utility function

52Why you must maximize expected utility

51Original Research on Less Wrong

32Expected utility, unlosing agents, and Pascal's mugging

Load More (5/8)

New Comment

Rendering 0/68 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 6:55 PM

Moderation Log

More from Stuart_Armstrong

Curated and popular this week

68Comments

Comment Permalink

alyssavance17y00

"1) I wasn't claiming that Allais is about risk aversion."

The difference between your preferences over choosing lottery A vs. lottery B when both are performed a million times, and your preferences over choosing A vs. B when both are performed once, is a measurement of your risk aversion; this is what Gray Area was talking about, is it not?

"Believe it or not, when I say, "I prefer B to A", it doesn't mean "I hereby legally obligate myself to redeem on demand any B for an A""

Then you must be using a different (and, I might add, quite unusual) definition of the word "preference". To quote dictionary.com:

pre⋅fer /prɪˈfɜr/ [pri-fur] –verb (used with object), -ferred, -fer⋅ring.

to set or hold before or above other persons or things in estimation; like better; choose rather than: to prefer beef to chicken.

What does it mean to say that you prefer B to A, if you wouldn't trade B for A if the trade is offered? Could I say that I prefer torture to candy, even if I always choose candy when the choice is offered to me?

Typo: Did you mean "prefer A to B"?

Psychohistorian17y00

I prefer B to A does not imply I prefer 10B to 10A, or even I prefer 2B to 2A. Expected utility != expected return.

I agree pretty much completely with Silas. If you want to prove that people are money pumps, you need to actually get a random sample of people and then actually pump money out of them. You can't just take a single-shot hypothetical and extrapolate to other hypotheticals when the whole issue is how people deal with the variability of returns.

-1SilasBarta17y

No, it's not, and the problem asserted by Allais paradox is that the utility function is inconsistent, no matter what the risk preference. [...] I don't see anything in there that about how many times the choice has to happen, which is the very issue at stake. If there's any unusualness, it's definitely on your side. When you buy a chocolate bar for a dollar, that "preference of a chocolate bar to a dollar" does not somehow mean that you are willing to trade every dollar you have for a chocolate bar, nor have you legally obligated yourself to redeem chocolate bars for dollars on demand (as a money pump would require), nor does anyone expect that you will trade the rest of your dollars this way. It's called diminishing marginal utility. In fact, it's called marginal analysis in general. [...] It means you would trade B for A on the next opportunity to do so, not that you would indefinitely do it forever, as the money pump requires.

See in context