You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

bogus comments on [Link] AlphaGo: Mastering the ancient game of Go with Machine Learning - Less Wrong Discussion

14 Post author: ESRogs 27 January 2016 09:04PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (122)

You are viewing a single comment's thread. Show more comments above.

Comment author: jacob_cannell 29 January 2016 05:45:18PM *  3 points [-]

In the brain, the same circuitry that is used to solve vision is used to solve most of the rest of cognition - vision is 10% of the cortex. Going from superhuman vision to superhuman Go suggests superhuman anything/everything is getting near.

The reason being that strong Go requires both deep slow inference over huge data/time (which DL excels in, similar to what the cortex/cerebellum specialize in), combined with fast/low data inference (the MCTS part here). There is still much room for improvement in generalizing beyond current MCTS techniques, and better integration into larger scale ANNs, but that is increasingly looking straightforward.

It's tempting to assume that the "keystone, foundational aspect" of intelligence is learning essentially the same way that artificial neural networks learn.

Yes, but only because "ANN" is enormously broad (tensor/linear algebra program space), and basically includes all possible routes to AGI (all possible approximations of bayesian inference).

But humans can do things like "one-shot" learning, learning from weak supervision, learning in non-stationary environments, etc. which no current neural network can do, and not just because a matter of scale or architectural "details".

Bayesian methods excel at one shot learning, and are steadily integrating themselves into ANN techniques (providing the foundation needed to derive new learning and inference rules). Progress in transfer and semi-supervised learning is also progressing rapidly and the theory is all there. I don't know about non-stationary as much, but I'd be pretty surprised if there wasn't progress there as well.

Thus I think it's fair to say that we still don't know what the foundational aspects of intelligence are.

LOL. Generalized DL + MCTS is - rather obviously - a practical approximation of universal intelligence like AIXI. I doubt MCTS scales to all domains well enough, but the obvious next step is for DL to eat MCTS techniques (so that super new complex heuristic search techniques can be learned automatically).

Comment author: bogus 29 January 2016 06:01:49PM *  0 points [-]

Yes, but only because "ANN" is enormously broad (tensor/linear algebra program space), and basically includes all possible routes to AGI (all possible approximations of bayesian inference).

"Enormously broad" is just another way of saying "not very useful". We don't even know in which sense (if any) the "deep networks" that are used in practice may be said to approximate Bayesian inference; the best we can do, AIUI, is make up a hand-wavy story about how they must be some "hierarchical" variation of single-layer networks, i.e. generalized linear models.

Comment author: jacob_cannell 29 January 2016 09:58:41PM 1 point [-]

Specifically I meant approx bayesian inference over the tensor program space to learn the ANN, not that the ANN itself needs to implement bayesian inference (although they will naturally tend to learn that, as we see in all the evidence for various bayesian ops in the brain) .