MrMind comments on AlphaGo versus Lee Sedol - Less Wrong

17 Post author: gjm 09 March 2016 12:22PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (183)

You are viewing a single comment's thread. Show more comments above.

Comment author: MrMind 10 March 2016 08:28:12AM 0 points [-]

AlphaGo has convolutional neural network, supervised learning, self-generated supervised learning, a mix-up strategy between Monte Carlo rollouts and goal function estimation.
All these strategies are apted to go because it is a spatial game with a very well defined strategy function.
While I do see CNN and supervised learning well worth of being used for music, it is much more difficult to come up with something that resembles the third step in AlphaGo: generating millions of random 'games' (simphonies) with their own label (good music/bad music) to train an 'intuitive' network.

Comment author: gwern 10 March 2016 03:03:57PM 3 points [-]

While I do see CNN and supervised learning well worth of being used for music, it is much more difficult to come up with something that resembles the third step in AlphaGo: generating millions of random 'games' (simphonies) with their own label (good music/bad music) to train an 'intuitive' network.

Adversarial generative networks give you a good objective if you want to take a purely supervised approach.

Comment author: ChristianKl 10 March 2016 10:53:57AM 1 point [-]

generating millions of random 'games' (simphonies) with their own label (good music/bad music) to train an 'intuitive' network.

A spotify like service could be used to label the quality.

Alternatively it would also be nice to have music that's trained for specific goals like helping people concentrate while they work or reducing stress.