How Not to be Stupid: Adorable Maybes

Psy-Kosh

1 How Not to be Stupid: Adorable Maybes

by Psy-Kosh

29th Apr 2009

3 min read

1

Previous: Know What You Want

Ah wahned yah, ah wahned yah about the titles. </some enchanter named Tim>

(Oh, a note: the idea here is to establish general rules for what sorts of decisions one in principle ought to make, and how one in principle ought to know stuff, given that one wants to avoid Being Stupid. (in the sense described in earlier posts) So I'm giving some general and contrived hypothetical situations to throw at the system to try to break it, to see what properties it would have to have to not automatically fail.)

Okay, so assuming you buy the argument in favor of ranked preferences, let's see what else we can learn by considering sources of, ahem, randomness:

Suppose that either via indexical uncertainty, or it turns out there really is some nondeterminism in the universe, or there's some source of bits such that the only thing you're able to determine about it is that the ratio of 1s it puts out to total bits is p. You're not able to determine anything else about the pattern of bits, they seem unconnected to each other. In other words, you've got some source of uncertainty that leaves you only knowing that some outcomes happen more often than others, and potentially you know something about the precise relative rates of those outcomes.

I'm trying here to avoid actually assuming epistemic probabilities. (If I've inserted an invisible assumption for such that I didn't notice, let me know.) Instead I'm trying to construct a situation in which that specific situation can be accepted as at least validly describable by something resembling probabilities (propensity or frequencies. (frequencies? aieeee! Burn the heretic, or at least flame them without mercy! :))) So, for whatever reason, suppose the universe or your opponent or whatever has access to such a source of bits. Let's consider some of the implications of this.

For instance, suppose you prefer A > B.

Now, suppose you are somehow presented with the following choice: Choose B, or choose a situation in which if, at a specific instance, the source outputs a 1, A will occur. Otherwise, B occurs. We'll call this sort of situation a p*A + (1-p)*B lottery, or simply p*A + (1-p)*B

So, which should you prefer? B or the above lottery? (assume there's no other cost other than declaring your choice. Or just wanting the choice. It's not a "pay for a lottery ticket" scenario yet. Just a "assuming you simply choose one or the other... which do you choose?")

Consider our holy law of "Don't Be Stupid", specifcally in the manifestation of "Don't automatically lose when you could potentially do better without risking doing worse. It would seem the correct answer would be "choose the lottery, dangit!" The only possible outcomes of it are A or B. So it can't possibly be worse than B, since you actually prefer A. Further, choosing B is accepting an automatic loss compared to chosing the above lottery which at least gives you a chance of to do better. (obviously we assume here that p is nonzero. In the degenerate case of p = 0, you'd presumably be indifferent between the lottery and B since, well... choosing that actually is the same thing as choosing B)

By an exactly analogous argument, you should prefer A more than the lottery. Specifically, A is an automatic WIN compared to the lottery, which doesn't give you any hope of doing better than A, but does give you a chance of doing worse.

Example: Imagine you're dying horribly of some really nasty disease that know isn't going to heal on its own and you're offered a possible medication for it. Assume there's no other medication available, and assume that somehow you know as a fact that none of the ways it could fail could possibly be worse. Further, assume that you know as a fact no one else on the planet has this disease, and the medication is availible for free to you and has already been prepared. (These last few assumptions are to remove any possible considerations like altruistically giving up your dose of the med to save another or similar.)

Do you choose to take the medication or no? Well, by assumption, the outcome can't possibly be worse than what the disease will do to you, and there's the possibility that it will cure you. Further, there're no other options availible that may potentially be better than taking this med. (oh, assume for whatever reason cryo, so taking an ambulance ride to the future in hope of a better treatment is also not an option. Basically, assume your choices are "die really really horribly" or "some chance of that, and some chance of making a full recovery. No chance of partially surviving in a state worse than death."

So the obviously obvious choice is "choose to take the medication."

Next time: We actually do a bit more math based on what we've got so far and begin to actually construct utilities.

Probabilistic ReasoningProbability & StatisticsUtility Functions

Personal Blog

1

New Comment

Rendering 0/55 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 11:42 PM

Moderation Log

1 How Not to be Stupid: Adorable Maybes

by Psy-Kosh

29th Apr 2009

3 min read

1

Previous: Know What You Want

Ah wahned yah, ah wahned yah about the titles. </some enchanter named Tim>

Okay, so assuming you buy the argument in favor of ranked preferences, let's see what else we can learn by considering sources of, ahem, randomness:

For instance, suppose you prefer A > B.

So the obviously obvious choice is "choose to take the medication."

Next time: We actually do a bit more math based on what we've got so far and begin to actually construct utilities.

Probabilistic ReasoningProbability & StatisticsUtility Functions

Personal Blog

1

Mentioned in

2How Not to be Stupid: Brewing a Nice Cup of Utilitea

New Comment

Rendering 0/55 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 11:42 PM

Moderation Log

More from Psy-Kosh

Curated and popular this week

55Comments

Comment Permalink

cousin_it17y10

What I'm going for here is more "why assume that Bayesian decision theory is the thing we should be building approximations to, rather than some other entirely different blob of math?"

Over the last couple years I went from believing that statement to deeply doubting it. If you want a chess player that will win games by holding the opponents' kids hostage, sure, build a Bayesian optimizer. My personal feeling is that even an ordinary human modified to be deeply and genuinely driven by an explicit utility function would pose a substantial danger to this world. No need for AIs.

Vladimir_Nesov17y10

That is a right sentiment about strength: there are no simple rules, only goals, which makes a creative mind extremely dangerous. And we shouldn't build things like this without understanding what the outcome will be. This is one of the reasons it's important to understand human values in this light, to guard them from this destructive potential.

Whatever you want accomplished, whatever you want averted, instrumental rationality defines an optimal way of doing that (without necessarily giving the real-world means, that's a next step). If you really want lif... (read more)

3Psy-Kosh17y

That's where the whole "don't assume an overly simplistic preference ranking for yourself" warnings come in. ie, nothing wrong with the utility function being composed of terms for all the things we value, and simply happening to include for that player a component that translates to "win at chess by actually playing chess", and other components giving stuff that lowers utility for "kids have been kidnapped" situations, etc etc etc. The hard part is, of course, actually translating the algorithms we're running (including the bits that respond to arguments that lead us to become convinced to change our minds about a moral question, etc etc) into a more explicit algorithm. Any simple one is going to get it WRONG. But that's not a hit against decision theory. That's a hit against bad utility functions. Or did I utterly misunderstand your point?

See in context