Open Thread, February 15-29, 2012

OpenThreadGuy

If it's worth saying, but not worth its own post (even in Discussion), then it goes here.

But even if we're just worried about mugging, how could you possibly weight it enough? Even if paying once doomed me to spend the rest of my life paying $5 to muggers, the utility calculation still works out the same way.

My idea is as follows:

Mugger: Give me 5 dollars, or I'll torture 3^^^3 sentient people across the omniverse using my undetectable magical powers.
AI: If I make my decision on this and similar trades based on a decision process DP0 of comparing the disutility(3^^^3 torture) P(you're telling the truth) compared to the disutility(giving you 5 dollars), then even if you're telling the truth, a different malicious agent may then merely name a threat that involves 3^^^^3 tortures, and thus make me cause a vastly great amount of disutility in his service. Indeed there's no upper bound to the disutility such a hypothetical agent may claim will cause, and therefore surrendering to such demands mean a likewise unbounded exploitation potential. Therefore I will not* use the decision process DP0, and will instead utilize some different decision process (like "Never surrender to blackmail" or "Always demand proportional evidence before considering sufficiently extraordinary claims").

Give me $5 or I will torture 3^^^^3 sentient people across the omniverse for 1,000 years each and then kill them. using my undetectable magical powers. You can pay me by paypal to mwengler@gmail.com. Unless 20 people respond (or the integrated total I receive reaches $100) then I will carry out the torture.

Now you may think I am making the above statement to make a point. Indeed it seems probable, but what if I am not? How do you weigh the very finite probability that I mean it against 3^^^^3 sentient lives

I feel confident that the amount of money I r... (read more)

2endoself14y

Saving 3^^^^3 people is more than worth a bit of vulnerability to blackmail. If 3^^^^3 people are in danger, the AI wishes to believe 3^^^^3 people are in danger and in that case "never surrender to blackmail" is a strictly worse strategy. Also, DP0 isn't even a coherent decision process. The expected utilies will fail to converge if "there's no upper bound to the disutility such a hypothetical agent may claim" and these claims are interpreted with some standard assumptions, so the agent has no way of even comparing expected utilities of actions.

0thomblake14y

So the AI is not an expected utility maximizer? If it is not, then what is it? If it is, then what calculations did it use to reach the above decision - what were the assigned probabilities to the scenarios mentioned?

6

Open Thread, February 15-29, 2012

6

6

6

Open Thread, February 15-29, 2012

6

6