Unnamed comments on Random thought: What is the optimal PD strategy under imperfect information? - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (5)
Wikipedia actually has a pretty good (though brief) discussion of this. They mention three alternative strategies that could deal with noise: tit for two tats (only defect after your opponent defects twice), tit for tat with forgiveness (randomly cooperate after an opponent's defection with probability p), and contrite tit for tat (cooperate extra after you accidentally defect). The article that they cite for contrite tit for tat, Boyd (1989), looks promising. And googling contrite tit for tat turned up a relevant paper by Wu & Axelrod (1995) (pdf):
Boyd, R. (1989). Mistakes Allow Evolutionary Stability in the Repeated Prisoner's Dilemma Game. Journal of Theoretical Biology, 136 (1): 47-56.
Wu, J. & Axelrod, R. (1995). How to cope with noise in the iterated prisoner's dilemma. Journal of Conflict Resolution, 39, 183-189.
If I remember correctly, it matters a lot exactly what the noise parameter is. As soon as things get noisy enough, Grim (start off cooperating, then defect if the opponent has ever defected) starts to dominate all of the clever Tit for Tat variants. Obviously, if you make things noisy enough, then Always Defect becomes the best strategy, but Grim does well long before that.
We had an IPD tournament with noise at our university recently, and I entered a variant of Downing (essentially, model your opponent as some sort of Markovian process) which won quite convincingly (mostly because it could exploit Always Cooperate, which was in the initial pool of strategies, better than the TfT variants).