ChristianKl comments on AlphaGo versus Lee Sedol - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (183)
Has anyone from Google commented much on AlphaGo's mistakes here? Why it made the mistake at 79, why it didn't notice until later that it was suddenly losing, and why it started playing so badly when it did notice.
(I've seen commentary from people who've played other monte-carlo based bots, but I'm curious whether Google has confirmed them.)
I don't think I've seen anyone say this explicitly: I would guess that part of the problem was AG hasn't had much training in "mistakes humans are likely to make". With good play, it could have recovered against Lee, but not against itself, and it didn't know it was playing Lee; somehow, the moves it actually played were ones that would have increased its chances of winning if it was playing itself.
I think the DeepMind folks said that they have to get back to London to analyse the case in detail.
I don't think that's a good explanation. There's no way that removing it's own ko threats with moves like P14 and O11 would have increased it's chances if it would have played against itself.
It look's a bit like belief propagation to update after missing an important move doesn't really work.