Suppose you have two identical agents with shared finances, and three rooms A1, A2, B.
Flip a fair coin.
- If the coin comes up H, put the agents in A1, A2.
- If it comes up T, flip the coin again.
- If it comes up H, put the agents in A1, B.
- If it comes up T, put the agents in A2, B.
(At each point, flip another fair coin to decide the permutation, i.e. which agent goes to which room.)
Now to each agent in either A1 or A2, make the following offer:
Guess whether the first coin-flip came up heads or tails. If you correctly guess heads, you both get $1. If you correctly guess tails, you both get $3. No negative marking.
The agents are told which room they are in, and they know how the game works, but they are not told the results of any coin tosses, or where the other agent is, and they cannot communicate with the other agent.
...
In terms of resulting winning, if an agent chooses to precommit to always bet heads, its expected earnings are $1, but if it chooses to precommit to always bet tails, its expected earnings are $1.50. So it should bet tails, if it wants to win.
But consider what happens when the agent actually finds itself in A1 or A2 (which are the only cases it is allowed to bet): if it finds itself in A1, it disqualifies the TT scenario, and if it finds itself in A2, it disqualifies the TH scenario. In either case, the probability of heads goes up to 2/3. So then it expects betting heads to provide an expected return of $1.33, and betting tails to provide an expected return of $1. So it bets heads.
(There are no Sleeping Beauty problems here, the probability genuinely does go up to 2/3, because new information -- the label of the room -- is introduced. BTW, I later learned this is basically equivalent to the scenario in Conitzer 2017, except it avoids talking about memory wiping or splitting people in two or anything else like that.)
What's going on? Is this actually a way to beat superrational agents, or am I missing thing? Because clearly tails is the winning strategy, but heads is what EDT tells the agent to bet.
Oh right, I see where you're coming from. When I said "you can't control their vote" I was missing the point, because as far superrational agents are concerned, they do control each other's votes. And in that case, it sure seems like they'll go for the $2, earning less money overall.
It occurs to me that if team 4 didn't exist, but teams 1-3 were still equally likely, then "heads" actually would be the better option. If everyone guesses "heads," two teams are right, and they take home $4. If everyone guesses "tails," team 3 takes home $3 and that's it. On average, this maximizes winnings.
Except this isn't the same situation at all. With group 4 eliminated from the get go, the remaining teams can do even better than $4 or $3. Teammates in room A2 knows for a fact that the coin landed heads, and they automatically earn $1. Teammates in room A1 are no longer responsible for their teammates' decisions, so they go for the $3. Thus teams 1 and 2 both take home $1 while team 3 takes home $3, for a total of $5.
Maybe that's the difference. Even if you know for a fact that you aren't on team 4, you also aren't in a world where team 4 was eliminated from the start. The team still needs to factor into your calculations... somehow. Maybe it means your teammate isn't really making the same decision you are? But it's perfectly symmetrical information. Maybe you don't get to eliminate team 4 unless your teammate does? But the proof is right in front of you. Maybe the information isn't symmetrical because your teammate could be in room B?
I don't know. I feel like there's an answer in here somewhere, but I've spent several hours on this post and I have other things to do today.