You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

ChristianKl comments on AlphaGo versus Lee Sedol - Less Wrong Discussion

17 Post author: gjm 09 March 2016 12:22PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (183)

You are viewing a single comment's thread. Show more comments above.

Comment author: philh 15 March 2016 12:09:52PM 2 points [-]

Has anyone from Google commented much on AlphaGo's mistakes here? Why it made the mistake at 79, why it didn't notice until later that it was suddenly losing, and why it started playing so badly when it did notice.

(I've seen commentary from people who've played other monte-carlo based bots, but I'm curious whether Google has confirmed them.)

I don't think I've seen anyone say this explicitly: I would guess that part of the problem was AG hasn't had much training in "mistakes humans are likely to make". With good play, it could have recovered against Lee, but not against itself, and it didn't know it was playing Lee; somehow, the moves it actually played were ones that would have increased its chances of winning if it was playing itself.

Comment author: ChristianKl 15 March 2016 04:11:37PM 0 points [-]

I think the DeepMind folks said that they have to get back to London to analyse the case in detail.

somehow, the moves it actually played were ones that would have increased its chances of winning if it was playing itself.

I don't think that's a good explanation. There's no way that removing it's own ko threats with moves like P14 and O11 would have increased it's chances if it would have played against itself.

It look's a bit like belief propagation to update after missing an important move doesn't really work.