This post slightly improves on the impossibility results in my last post and A paradox for tiny probabilities and enormous values. This is very similar to those arguments but with minor technical differences. I may refer people here if we're discussing details of the argument against unbounded utilities, but for more interesting discussion you should check out those other writeups.
Summarizing the key differences from my last post:
- I replace the dominance principle with .
- Assuming unbounded positive and negative utilities, I remove the assumption of Intermediate Mixtures.
Weakening dominance
We'll represent preferences by a relation over probability distributions over some implicit space of outcomes (and we'll identify outcomes with the constant probability distribution). Define to mean ( and not ).
We'll show that it's impossible for to satisfy four properties: unbounded utilities, intermediate mixtures, very weak dominance, and transitivity.
The properties
Unbounded utilities: there is an infinite sequence of outcomes each "more than twice as good"[1] as the last. More formally, there exists an outcome such that:
- for every
That is: is not as good as a chance of , which is not as good as a chance of , which is not as good as a chance of ...
Intermediate mixtures: If and ,[2] then .
That is: a chance of a good outcome is strictly worse than a sure thing.[3]
Weak lottery-lottery dominance: Let , and be sequences of lotteries, and be a sequence of probabilities summing to 1. If for all , then .
Transitivity: if and , then .[4]
Inconsistency proof
Consider the lottery , where each outcome is as likely as the one before. Intuitively this sum is "more infinite" than the usual St Petersburg lottery and this will be important for our proof.
By weak dominance we can replace by without making the lottery any better. Thus:
We can rewrite this new lottery as a mixture:
We can apply bounded utilities to each parenthesized expression. Combining with weak dominance, we obtain:
Write for the right hand side of this inequality.
Now note that , despite the fact that . If we knew , then intermediate mixtures would imply that , contradicting .
So all that remains is to show that . This will involve a bit of annoying arithmetic but it should feel pretty obvious given that all the outcomes in seems much better than .
Combining with weak dominance and we get:
Then we write the right hand side as a mixture and apply unbounded utilities and weak dominance:
Leveraging negative utilities
To rule out unbounded utilities, we've made two substantive consistency assumptions: intermediate mixtures and weak lottery-lottery dominance. The assumption of intermediate mixtures is necessary: we could satisfy the other properties by simply being indifferent between all lotteries with infinite expectations.
But if we can have unboundedly good or unboundedly bad outcomes, then we can obtain a contradiction even without infinite mixtures. That is, we will show that there is no relation satisfying transitivity, symmetric unbounded utilities, and weak outcome-lottery dominance.
I think that almost anyone who accepts unbounded utilities (in the informal sense) should accept symmetric unbounded utilities, so I think they probably need to reject weak outcome-lottery dominance (or stop defining preferences over arbitrary probability distributions). I find this pretty damning, but maybe others are more comfortable with it.
The properties
Transitivity. If and , then .
Symmetric unbounded utilities: There is a pair of outcomes (i.e. it's not the case that all pairs of outcomes are either incomparable or equal). Moreover:
- For any[5] pair of outcomes , there is outcome such that .
- For any pair of outcomes there is an outcome such that .
In words: no matter how much better is than , there's always an that's 2x "further away" from on the other side. By that we mean that a risk of moving from to can offset a chance of moving from to . And no matter how bad an outcome is, there's always an that's 2x "further away" from on the other side.
Weak outcome-lottery dominance: Let be an outcome, let be a sequence of lotteries, and let be a sequence of probabilities summing to 1. If for all , then . Similarly, if for all , then .
Inconsistency proof
Define a sequence of outcomes as follows:
- Pick arbitrarily.
- Take to be the "reflection" of across as defined in symmetric unbounded utilities.
- Take to be the reflection of across .
- Take to be the reflection of across .
- Take to be the reflection of across .
- And so on.
Now define the lottery
We can write as the mixture:
By Unbounded Utilities each of these terms is . So by weak dominance,
But we can also write as the mixture:
By Unbounded Utilities each of these terms is . So by weak dominance, .
Now we have . By transitivity, , contradicting .
- ^
Of course the same argument would work if we replaced "good" with "bad."
- ^
Note that we only actually need to apply this principle for and so a reader squeamish about very small probabilities need not be concerned. By making the argument with different numbers we could probably just fix .
- ^
Despite appearing innocuous, this might be more "controversial" than very weak dominance. Many theories say that if is infinitely good, then it doesn't matter whether I achieve with 100% probability or 50% probability (or 1% probability or 0.000001% probability...) I find this sufficiently unappealing to reject such theories out of hand, but each reader must pick their own poison.
- ^
I'm pretty sure this isn't necessary for the proof, since we only use it as a convenience for short manipulations rather than to establish very long chains. That said, it makes the proof easier and it's a pretty mild assumption.
- ^
We don't really need this universal quantifier---it would be enough to define a single chain of escalating and alternative outcomes. But the quantitative properties we need out of the chain are somewhat arbitrary, so it seemed clearer to state an axiom capturing why unbounded utilities allow us to construct a particular kind of chain, rather than to directly posit a chain. That said, it results in a stronger assumption---for example it may be that there is an outcome which can offset any negative outcome, while still having a chain of increasingly large finite outcomes.
This is an interesting theorem which helps illuminate the relationship between unbounded utilities and St Petersburg gambles. I particularly appreciate that you don't make an explicit assumption that the values of gambles must be representable by real numbers which is very common, but unhelpful in a setting like this. However, I do worry a bit about the argument structure.
The St Petersburg gamble is a famously paradox-riddled case. That is, it is a very difficult case where it isn't clear what to say, and many theories seem to produce outlandish results. When this happens, it isn't so impressive to say that we can rule out an opposing theory because in that paradox-riddled situation it would lead to strange results. It strikes me as similar to saying that a rival theory leads to strange result in variable population-size cases so we can reject it (when actually, all theories do), or that it leads to strange results in infinite population cases (when again, all theories do).
Even if one had a proof that an alternative theory doesn't lead to strange conclusions in the St Petersburg gamble, I don't think this would count all that much in its favour. As it seems plausible to me that various rules of decision theory that were developed in the cleaner cases of finite possibility spaces (or well-behaved infinite spaces) need to be tweaked to account for more pathological possibility spaces. For a simple example, I'm sympathetic to the sure thing principle, but it directly implies that the St Petersburg Gamble is better than itself, because an unresolved gamble is better than a resolved one, no matter how the latter was resolved. My guess is that this means the sure thing principle needs to have its scope limited to exclude gambles whose value is higher than that of any of their resolutions.
Thanks for the reference, seems better than my post and I hadn't seen it. (I think it's the version of the argument I allude to in this comment.)
Note that if you have unboundedly positive and unboundedly negative outcomes, then you must also violate the weaker version of the countable sure thing principle with weak inequality instead of strict inequality. Violating the weak version of the sure thing principle seems much worse to me. And I think most proponents of unbounded preferences would advocate for them to point in both directions, so they run into th... (read more)