huh comments on Pascal's Muggle: Infinitesimal Priors and Strong Evidence - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (404)
Isn't this more of social recognition of a scam?
While there are decision-theoretic issues with the Original Pascal's Wager, one of the main problems is that it is a scam ("You can't afford not to do it! It's an offer you can't refuse!"). It seems to me that you can construct plenty of arguments like you just did, and many people wouldn't take you up on the offer because they'd recognize it as a scam. Once something has a high chance of being a scam (like taking the form of Pascal's Wager), it won't get much more of your attention until you lower the likelihood that it's a scam. Is that a weird form of Confirmation Bias?
But nonetheless, couldn't the AI just function in the same way as that? I would think it would need to learn how to identify what is a trick and what isn't a trick. I would just try to think of it as a Bad Guy AI who is trying to manipulate the decision making algorithms of the Good Guy AI.
I also think that the variant of the problem featuring an actual mugger is about scam recognition.
Suppose you get an unsolicited email claiming that a Nigerian prince wants to send you a Very Large Reward worth $Y. All you have to do is send him a cash advance of $5 first ...
I analyze this as a straightforward two-player game tree via the usual minimax procedure. Player one goes first, and can either pay $5 or not. If player one chooses to pay, then player two goes second, and can either pay Very Large Reward $Y to player one, or he can run away with the cash in hand. Under the usual minimax assumptions, player 2 is obviously not going to pay out! Crucially, this analysis does not depend on the value for Y.
The analysis for Pascal's mugger is equivalent. A decision procedure that needs to introduce ad hoc corrective factors based on the value of Y seems flawed to me. This type of situation should not require an unusual degree of mathematical sophistication to analyze.
When I list out the most relevant facts about this scenario, they include the following: (1) we received an unsolicited offer (2) from an unknown party from whom we won't be able to seek redress if anything goes wrong (3) who can take our money and run without giving us anything verifiable in return.
That's all we need to know. The value of Y doesn't matter. If the mugger performs a cool and impressive magic trick we may want to tip him for his skillful street performance. We still shouldn't expect him to payout Y.
I generally learn a lot from the posts here, but in this case I think the reasoning in the post confuses rather than enlightens. When I look back on my own life experiences, there are certainly times when I got scammed. I understand that some in the Less Wrong community may also have fallen victim to scams or fraud in the past. I expect that many of us will likely be subject to disingenuous offers by unFriendly parties in the future. I respectfully suggest that knowing about common scams is a helpful part of a rationalist's training. It may offer a large benefit relative to other investments.
If my analysis is flawed and/or I've missed the point of the exercise, I would appreciate learning why. Thanks!
When you say that player 2 "is obviously not going to pay out" that's an approximation. You don't know that he's not going to pay off. You know that he's very, very, very, unlikely to pay off. (For instance, there's a very slim chance that he subscribes to a kind of honesty which leads him to do things he says he'll do, and therefore doesn't follow minimax.) But in Pascal's Mugging, "very, very, very, unlikely" works differently from "no chance at all".