Papers framing anthropic questions as decision problems?

jsalvatier

A few weeks ago at a Seattle LW meetup, we were discussing the Sleeping Beauty problem and the Doomsday argument. We talked about how framing Sleeping Beauty problem as a decision problem basically solves it and then got the idea of using same heuristic on the Doomsday problem. I think you would need to specify more about the Doomsday setup than is usually done to do this.

We didn't spend a lot of time on it, but it got me thinking: Are there papers on trying to gain insight into the Doomsday problem and other anthropic reasoning problems by framing them as decision problems? I'm surprised I haven't seen this approach talked about here before. The idea seems relatively simple, so perhaps there is some major problem that I'm not seeing.

I don't think so. But I can give you the critique here :)

SUMMARY: If you don't actually calculate expected utility, don't expect to automatically make choices that correspond to a relevant utility function. Also, don't name a specific function for a general feeling - you might accidentally start calling the function when you just mean the feeling, and then you get really wrong answers.

I've already made the obvious criticism - since Stuart's anthropic decision theory is basically a way to avoid using subjective probabilities (also known as "probabilities") when they're confusing, it makes sense that it usually can't do probability-associated things like maximizing the expectation of a specific utility function.

The replacement for actually figuring out the probabilities is an "anthropic preference" that takes a list of utilities of different people you could be inside a "world," then outputs some effective utility for that world. That is, it's some function A(U(1), U(2), U(3), ... ), where U is the utility of being a certain person. The "worlds" are the different outcomes that you would know the probabilities for if no confusing anthropic things happened (for example, in a coin flip, the probabilities of the heads and tails worlds are 0.5). So the expected utility-ish-stuff is the sum over the worlds of P(world, if there was no anthropic stuff) * A(U(1), U(2), U(3), ... ).

Expected utility, on the other hand, can be written as the sum over people you could be of P(being this person) U(being this person). So the only time when this anthropic decision theory actually maximizes utility is when its expected utility-ish-stuff is proportional to the actual expected utility - that is, you have to pick the correct anthropic preference out of all possible functions, and it can be a different* one for every different problem. The easiest way to do this is simply to know what the expected utility is beforehand, and in that case you might as well just maximize that :P

On the other hand, it's not impossible to find a few useful "A"s, in the occasion that things are really simple. For example, if the different probabilities of being people are all a constant, multiplied by P(world, if there was no anthropic stuff), then the probability of being in a "world" is proportional to the number of people in that world, and the expected utility of being in a world is just the unweighted average of the utility. So the anthropic preference "A" just has to add everything together.

The Sleeping Beauty problem is an example that's simple enough to meet these conditions. There are a few other simple situations that can also yield simple "A" functions. However, if you drift away from simplicity, for example by giving people weak evidence about which world they're in, you can't even always find an "A" that gives you back the expected utility.

Another way to get into trouble is if you use an "A" that you think corresponds to a specific expected utilt, but you have not ever proved this. This was my real beef with Stuart's paper - he names his anthropic preferences with colorful descriptors like "altruistic" or "selfish." This is fine as long as you keep in mind that these names correspond to the anthropic preferences, not to any sort of utility function; but it leads to folly when he goes on to say things like "if we have an altruistic agent," as if he could tell how the agent acted in general. Because the "A" that corresponds to a utility function can change from problem to problem, calling an "A" function "selfish" can't somehow force it to correspond to a utility function we'd call selfish.

So in retrospect calling these anthropic preferences things like "altruistic" was a pretty bad idea, because it generated confusion. This confusion results in him whiffing the Presumptuous Philosopher problem, calling on what "selfish" versus "altruistic" methods of adding utilities together would say, with no guarantee that they have anything to do with any relevant expected utility. In fact, at no point does he ever compare anything to expected utility, because, if you remember back 20 paragraphs ago ( :D ), the whole point of this decision process was to avoid confusing probabilities.

4Stuart_Armstrong14y

Many thanks for your comments; it's nice to have someone engaging with it. That said, I have to disagree with you! You're right, the whole point was to avoid using "anthropic probabilities" (though not "subjective probabilities" in general; I may have misused the word "objective" in that context). But the terms "altruistic", "selfish" and so on, do correspond to actual utility functions. "Selfless" means your utility function has no hedonistic content, just an arbitrary utility function over world states that doesn't care about your own identity. "Altruistic" means your utility function is composed of some amalgam of the hedonistic utilities of you and others. And "selfish" means your utility function is equal to your own hedonistic utility. The full picture is more complex - as always - and in some contexts it would be best to say "non-indexical" for selfless and "indexical" for selfish. But be as it may, these are honest, actual utility functions that you are trying to maximise, not "A"'s over utility functions. Some might be "A"'s over hedonistic utility functions, but they still are genuine utilities: I am only altruistic if I actually want other people to be happy (or achieve their goals or similar); their happiness is a term in my utility function. Then ADT can be described (for the non-selfish agents) coloquially as "before the universe was created, if i wanted to maximise U, what decision theory would I want any U-maximiser to follow? (ie what decision theory maximises the expected U in this context)". So ADT is doubly a utility maximising theory: first pick the utility maximising decision theory, then the agents with it will try and maximise utility in accordance with the theory they have. (for selfish/indexical agents it's a bit more tricky and I have to use symmetry or "veil of ignorance" arguments; we can get back to that). Furthermore, ADT can perfectly deal with any other type of uncertainty - such as whether you are or aren't in a Sleeping Beaut

6

Papers framing anthropic questions as decision problems?

6

6

6

Papers framing anthropic questions as decision problems?

6

6