Jayson_Virissimo comments on Paper: Iterated Prisoner’s Dilemma contains strategies that dominate any evolutionary opponent - Less Wrong

27 Post author: mapnoterritory 02 June 2012 08:50PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (19)

You are viewing a single comment's thread. Show more comments above.

Comment author: Jayson_Virissimo 03 June 2012 01:17:57PM *  2 points [-]

I'm not really able to evaluate the claims in the paper myself, so thanks for the input. Having said that, do you think the paper specifies the strategies with enough detail for us to code them up and test their mettle in a Less Wrong IPD tournament?

Comment author: Kindly 03 June 2012 01:38:07PM *  7 points [-]

Here's the first strategy that is explicitly stated (in the context of the 5/3/1/0 payouts):

  • If the last outcome was CC, cooperate with probability 11/13.
  • If the last outcome was CD, cooperate with probability 1/2.
  • If the last outcome was DC, cooperate with probability 7/26.
  • If the last outcome was DD, defect.

This supposedly "enforces an extortion factor of 3", whatever that means.

Comment author: Andreas_Giger 03 June 2012 04:04:37PM *  4 points [-]

Sure, but this extortionate strategy wouldn't survive long in such a tournament because it would perform poorly against TFT variants as well as against itself.

Comment author: Randaly 25 June 2012 10:57:47PM 2 points [-]

I know that this isn't exactly what you're asking, but: Stewart and Plotkin tested two variants of ZD strategies in a variant of Axelrod's original tournament; one variant (ZDGTFT-2) had the highest total score, beating TFT.

Comment author: Kindly 03 June 2012 05:27:15PM *  1 point [-]

Here's another strategy computed from the paper: cooperate with probabilities (0.9, 0.7, 0.2, 0.1) in the case (CC, CD, DC, DD). Supposedly this strategy is one that sets the opponent's score to an average of 2, regardless of his or her actions. You could come up with a similar strategy to force any outcome in the interval (1,3), excluding the endpoints.

I've yet to check how this works.

Comment author: shokwave 04 June 2012 06:47:34AM 0 points [-]

I am now reading the paper with intent to code them into the current simulator. Wish me luck; reading Dyson is a challenge.