pallas comments on Chocolate Ice Cream After All? - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (77)
I agree with your objection, but I think there is a way to fix this problem into a properly Newcomb-like form:
Genes GA and GB control your lifespan, but they don't directly affect your decision making processes, and cause no other observable effect.
Omega predicts in advance, maybe even before you are born, whether you will press the A button or the B button, and modifies you DNA to include gene GA if he predicts that you will press A, or GB if he predicts that you will press B.
This creates a positive correlation between GA and A, and between GB and B, without any direct causal relation.
In this scenario, evidential decision theory generally ( * ) chooses A.
The same fix can be applied to the Solomon's problem, Newcomb's soda, and similar problems. In all these cases, EDT chooses the "one-box" option, which is the correct answer by the same reasoning which can be used to show that one-boxing is the correct choice ( * ) for the standard Newcomb's problem.
I think that allowing the hidden variable to directly affect the agent's decision making process, as the original version of the problems above do, makes them ill-specified:
If the agent uses EDT, then the only way a hidden variable can affect their decision making process is by affecting their preferences, but then the "tickle defence" becomes valid: since the agent can observe their preferences, this screens off the hidden variable from any evidence gathered by performing an action, therefore EDT chooses the "two-box" option.
If the hidden variable, other other hand, affects whether the agent uses EDT or some corrupted version of it, then, the question "what should EDT do?" becomes potentially ill-posed: EDT presumes unbounded rationality, does that mean that EDT knows that it is EDT, or is it allowed to have uncertainty about it? I don't know the answer, but I smell a self-referential paradox lurking there.
( *
One-boxing is trivially the optimal solution to Newcomb's problem if Omega has perfect prediction accuracy.
However, if Omega's probability of error is epsilon > 0, even for a very small epsilon, telling the optimal solution isn't so trivial: it depends on the stochastic independence assumptions you make. )
Another possibility is that Omega presents the choice to very few people with the G_B gene in the first place; only to the ones who for some reason happen to choose B as Omega predicts. Likewise, Omega can't present the choice to every G_A carrier because some may select B.