MrMind comments on AlphaGo versus Lee Sedol - Less Wrong

17 Post author: gjm 09 March 2016 12:22PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (183)

You are viewing a single comment's thread. Show more comments above.

Comment author: HungryHobo 09 March 2016 06:07:16PM 4 points [-]

I'm quite interested in how many of the methods employed in this AI can be applied to more general strategic problems.

From talking to a friend who did quite a bit of work in machine composition, he was of the opinion that tools for handling strategy tasks like go would also apply strongly to many design tasks like composing good music.

Comment author: MrMind 10 March 2016 08:28:12AM 0 points [-]

AlphaGo has convolutional neural network, supervised learning, self-generated supervised learning, a mix-up strategy between Monte Carlo rollouts and goal function estimation.
All these strategies are apted to go because it is a spatial game with a very well defined strategy function.
While I do see CNN and supervised learning well worth of being used for music, it is much more difficult to come up with something that resembles the third step in AlphaGo: generating millions of random 'games' (simphonies) with their own label (good music/bad music) to train an 'intuitive' network.

Comment author: gwern 10 March 2016 03:03:57PM 3 points [-]

While I do see CNN and supervised learning well worth of being used for music, it is much more difficult to come up with something that resembles the third step in AlphaGo: generating millions of random 'games' (simphonies) with their own label (good music/bad music) to train an 'intuitive' network.

Adversarial generative networks give you a good objective if you want to take a purely supervised approach.

Comment author: ChristianKl 10 March 2016 10:53:57AM 1 point [-]

generating millions of random 'games' (simphonies) with their own label (good music/bad music) to train an 'intuitive' network.

A spotify like service could be used to label the quality.

Alternatively it would also be nice to have music that's trained for specific goals like helping people concentrate while they work or reducing stress.