Vaniver comments on Measures, Risk, Death, and War - Less Wrong

11 Post author: Vaniver 20 December 2011 11:37PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (14)

You are viewing a single comment's thread. Show more comments above.

Comment author: Vaniver 21 December 2011 02:49:27AM 4 points [-]

I stopped at 45 rounds because I had hit a pretty 15-15-15. I think the most I was ahead was when it was about 10-7-6. I found that I could do well in the short run by swapping patterns- 'play what would have lost to what he played', and then when he picks up on that switching to what would beat him if he believes that about me. It then got harder / I stopped putting as much effort into it.

If you click on the "what he's thinking" thing, it looks like he just has a 3^8 lookup table based on the last four rounds. Given that game state, he throws against whatever the most likely human action was- which suggests it might be possible to infer that lookup table from his behavior then use it to find a stable loop you can mine (until you dominate that part of the lookup table). It would probably be unethical to write an AI to beat their AI, though, since that would be screwing with their data about humans.

Comment author: Mass_Driver 24 December 2011 06:22:12AM 3 points [-]

As a way to conserve effort, you can just never throw rock, and try to pick scissors/paper at random. This is sufficiently unusual behavior that the 3^8 lookup table should fail enough to give you a small but stable edge. I went 10-7-7 doing this.