jacob_cannell comments on Rationality is about pattern recognition, not reasoning - Less Wrong

25 Post author: JonahSinick 26 May 2015 07:23PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (82)

You are viewing a single comment's thread. Show more comments above.

Comment author: Wei_Dai 28 May 2015 11:01:48PM 6 points [-]

Interesting. I found one paper that explains the one learning algorithm hypothesis and gives evidence for it. Quoting from it:

There seems to be some evidence for a single algorithm explaining the computations performed by the primary auditory, visual, motor, and somatosensory cortices. Given how little is known about higher-level processing immediately downstream of these primary areas, it is premature to generalize to other areas located in the occipital lobe. Less is known about the details of the computations performed in the areas located in anterior cortex and, in particular, the prefrontal cortex, which is disproportionately enlarged in humans when compared to non-human primates.

Is there anything more up to date or comprehensive than this paper?

This tangent aside, I agree that it would be really valuable to improve the way we process evidence subconsciously. I'm a bit skeptical that you've actually found such a method, but I hope that you succeed in writing it down and that it really works.

Comment author: jacob_cannell 13 June 2015 09:26:40PM *  1 point [-]

The paper you linked to about the one learning algorithm hypothesis is from 2012. Since that time the theory has gained significant strength from the advances in DL, and in particular the work on deep reinforcement learning. Proving that an ANN with a relatively simple initial/prior architecture and about 1 million neurons can reach human-level performance on a set of 100 games when trained end to end with RL is pretty strong (albeit indirect) evidence for the one learning hypothesis.

One key remaining question is then: how does the brain actually implement approximate optimization/learning that is at least as good as back-prop? We know that back-prop is not biologically realistic. On that front, Bengio's group has made significant recent progress with a new technique/theory called target propagation 1, which originated in part as an explanation for how the brain could implement credit assignment, but it also shows promise as a potential replacement for backprop 2 - which further increases the biological plausibility.

In terms of more direct evidence, the hippocampus in particular appears to have a simple explanation in terms of reinforcement learning 3.

In terms of the prefrontal cortex in particular, there are working theories that explain much of the PFC as a set of modules specialized for working memory buffers that are controlled by gating units in the basal ganglia. That whole system in particular is also driven/learned through dopamine based RL.