gjm comments on Innate Mathematical Ability - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (140)
This seems like an odd way of phrasing things, and the oddity may go deeper. As I understand it, Tao's verbal scores were still really good for an 8-year-old. So it's not like they indicate an actual mental deficit; it's just that he was really inhumanly good at mathematical reasoning versus only really good at verbal. Given that, I don't see why we should expect a "still lower ability" anywhere (I mean, beyond the trivial observation that min < average; I take it you are suggesting something more dramatic than that).
My impression from reading TT's blog is that he has rather a lot of useful things to say about thinking techniques; see e.g. lots of the links from here. He doesn't strike me as someone with "relatively low metacognition" unless you mean "low relative to his skill as a mathematician" (in which case: well, yes, but I don't think that's an interesting observation).
I have the impression that you've seized on the idea that "mathematicians tend to have poor metacognition" (on, I uncharitably suspect, weak evidence) and are trying to apply it even where it doesn't fit well. It's possible that TT has made an error here -- that exceptional IQ-style intelligence is more important to mathematical success than he thinks it is -- but I submit that "poor metacognition" is probably not the best analysis of why, if so.
Your paragraph stops here in mid-sentence.
This seems like pure baseless speculation to me.
I think the analogy is flawed in an important way. In our society (at least the bits of it that TT inhabits), intelligence is highly valued and respected; good looks less so. "You were only so successful because you're very good-looking" is an insult; "You were only so successful because you're incredibly intelligent" is a compliment. So if NP is reluctant to accept the former and TT is reluctant to accept the latter, it seems likely that their reasons and motives differ.
I find your comment helpful insofar as it points to ways in which my article might be misunderstood, but it would be more productive to be inquisitive.
No, I think that his verbal abilities are significantly above average relative to the general population, but perhaps only average relative to mathematicians as a group.
Do you know principal component analysis?
My point was that the SAT verbal is that surely partly a test of abstract reasoning ability of the type picked up on by Raven's Matrices, while partly being a test of a second thing, so that performance on the SAT verbal is determined by a weighted average of these two things, and that since Tao is really high on abstract reasoning ability, he must be lower on that second thing than his score would suggest if taken in isolation.
Yes, these things are all relative. I added an edit to my post to clarify. For the most part, I find Tao's comments on thinking techniques and his advice sound. But there are other elite mathematicians whose understanding runs much deeper, and this is in fact highly significant, just as it's highly significant that Tao was able to score 760 on the math SAT at age 8 rather than at age 13.
Do you have an alternative explanation to the two that I proposed? Surely you'll concede that there's something a priori very bizarre about the situation: Scott Alexander, who got a C- in calculus, is able to recognize a simple quantitative argument that one of the best mathematicians missed, despite the fact that Tao is much closer to the situation that Scott is analyzing than Scott is.
I agree that there's some asymmetry, but I don't think that it's relevant. The point that I was getting at is more subtle. It's clearly not true that Tao and Portman were only successful because of their intelligence and looks respectively. I think that a careful reading of my paragraph will make my meaning clear, but if not, I can try to clarify.
Seems plausible -- though for what it's worth I'd rate his verbal abilities substantially above those of mathematicians generally.
That would be what I described as "the trivial observation that min < average" :-) and sure, I agree that whatever feature of Tao's verbal intelligence is worst has to be worse than his overall verbal intelligence, but I don't see why that's interesting enough to be worth drawing attention to. I guess your point is that if his general intelligence is so spectacularly high then to average out correctly some aspect of his verbal intelligence must be quite a lot lower than his overall verbal -- but it seems equally plausible to me that verbal SAT results just don't depend all that strongly on the kind of pattern-spotting tested by the really hard Raven matrices.
I can think of several. He may have too low an opinion of his own intelligence because of the sort of weird psychological hangups that many very clever people have. He may interpret "intelligence" in a way that weights more-mathematical things less heavily (perhaps because, being so exceptional in the latter, he sees more clearly the distinction between those and other sorts of thinking). He may be a victim of Political Correctness Gone Mad and feel that he has to play down the importance of intelligence. His idea of what constitutes exceptionally high intelligence may be skewed by the fact that he is surrounded by super-smart people. He may have spent less time thinking about intelligence than Scott has (intelligence being something of a preoccupation in the rationalist community, and I suspect less so in Tao's circles).
But I can't quite agree with your framing of the question: that is, I am not convinced that he has missed the argument Scott describes. Scott's argument just says: one person who's incredibly good at mathematics and incredibly good at Raven's matrices is evidence that being exceptionally good at Raven's matrices is important for being incredibly good at mathematics. But one can accept that but still find that, overall, the evidence suggests that the connection (beyond a certain level of Raven's-matrices skill) isn't all that strong. (For instance, knowing a few people who are first-rate research mathematicians and not particularly good at super-hard Raven problems is pretty good evidence the other way.) And it seems to me, taking the outside view, that if I have to guess whether Terry Tao or Jonah Sinick has seen more evidence about what makes really exceptional professional mathematicians then I'm going to go with Terry Tao.
(My understanding is that you have spent a lot of time working with extremely able school pupils on Olympiad-ish mathematics -- my apologies if I've got the wrong end of the stick -- and my own experience suggests that Olympiad problems are substantially closer to pure g-tests than doing actual mathematical research. Have you considered the possibility that your estimate of the relationship between IQ-style intelligence and success in pure mathematical research may be biased by this?)
I agree, there's still some effect though.
The things that you list seem to me closely related to my second suggestion under "Is this all depressing?", e.g. I think that one factor that plays into "the political correctness gone mad" on this point is people want to believe that life is more fair than it actually is (for reasons overlapping somewhat with the reasons for the just-world fallacy).
I would agree, if not for the fact that I'm drawing on many sources (as I described in the introduction of my last post). Some mathematicians more successful than Tao hold a contrary position.
Your interpretation is very understandable. I wrote a blog post back in October 2010 implicitly expressing a position similar to your own.
What started to change my thinking was point (3) of Carl's Shulman's response to my post. At the time, I was unaware of the phenomenon that he described: that performance on one task is often highly predictive of performance on an apparently unrelated task.
Using a simple machine learning model, I found that amongst International Mathematics Olympiad contestants, those who went on to earn Fields medals and similar prizes had ~5x as great a priori odds relative to the average contestant, based on their IMO scores alone. The effect becomes even more pronounced when one weights prize winners by the significance of their work: for example, Perelman was one of only three perfect scorers in 1982. It doesn't necessarily agree with the inside view intuition that I've formed talking with lots of mathematicians, but the existence of a robust effect is unambiguous. I'll make a post going into detail later.
There aren't a lot of mathematicians more successful than Tao. I suppose he hasn't won the Abel Prize yet. (Looking at the list of winners, it looks as if that one tends to go to older mathematicians in recognition of their lifetime's great achievements. The youngest winner was Gromov: born 1943, Abel Prize in 2009.) Could tyou name some of the mathematicians you have in mind (and, even better, point us at what they've said on the subject)?
I was referring to successful research as opposed to success at winning prizes.
The connection between prizes and quality of research comes apart for a variety of reasons: arbitrary age restrictions (in both directions), ceiling effects (many prizes are awarded once a year independently of the quality of research of potential prize recipients), individual idiosyncrasies of the people on the committees that award prizes, etc.
One mathematician more accomplished than Tao is Robert Langlands, known for the so-called "Langlands Program."
In this essay, Langlands wrote:
I know this is only a single example – it's hard to find examples of mathematicians writing about the nature of mathematical talent in the public domain altogether. But I'll try to provide more later.
I wasn't meaning to imply that you define success in terms of prizes (and, for that matter, neither do I). I agree that Langlands is a more important mathematician than Tao. But that's a hell of a bar to clear. (Also, speaking of age effects, I remark that if you define mathematical success in terms of what one has achieved to date and its demonstrated influence in mathematics generally, you're inevitably going to prefer older mathematicians -- Langlands is 78 to Tao's 40ish -- and that's going to affect what biases they have affecting their ideas about intelligence, native talent, etc.)
The quotation from Langlands that you give is not affirming the same thing as Tao is denying (though it's possible that Tao would in fact deny it if asked), in at least two ways.
For the avoidance of doubt, I won't be terribly surprised if it turns out that (say) 75% of world-class mathematicians think top 0.1% IQ is necessary to be a top 0.1% mathematician, but I'm not sure you've made much of a case yet. I'd be a little more surprised if it were 75% of world-class mathematicians who have put as much thought into the question as Tao has; I've no idea how much Langlands has actually thought about the question, but a throwaway aside in an essay about something else isn't necessarily the product of deep thought.
I'll briefly state my own opinions on all this, not that there's any reason why anyone should care. I'll use the term "R" for the particular thing that, e.g., Raven's matrices test. I think it's obvious that, all else being equal, more of any cognitive strength is better for mathematical success; more R is always going to be an advantage. So, of course, are other cognitive strengths, and other attributes such as love of mathematics, capacity for hard work, etc. They all doubtless correlate somewhat with R. So the question is something like: at a given overall level of rarity-of-useful-attributes, what things does mathematical success depend on most strongly? If you're looking at a population for which some particular attribute is extreme, then prima facie you would expect that attribute to drop off in importance by this criterion as it gets more extreme, because of how rarity increases. I think it's uncontroversial that to achieve any kind of mathematical success it's almost essential to have a pretty good R. Beyond a certain level -- let's say roughly corresponding to a measured IQ of 140 or so -- I think the importance of different cognitive strengths starts to depend a lot on what kind of mathematics you're doing. For instance, I would guess that combinatorialists tend to have higher IQ than differential geometers. That doesn't mean that combinatorialists are smarter than differential geometers, for two reasons. (1) I think equating R with smartness becomes less sensible at the highest levels; extreme R is a specialized talent and in terms of (e.g.) practical success, giving an immediate impression of great brainpower, etc., I strongly suspect that at a given overall level of rarity you're going to do better with very high R plus very high other things (e.g., Scott-style verbalish reasoning) than with extreme R optimized for acing IQ tests. (2) The particular bundle of talents needed for great success in differential geometry is probably about as rare, and about as clearly constitutive of "smartness" by any reasonable standard, as the particular bundle needed for great success in combinatorics. And beyond a certain level -- whose rarity maybe corresponds to a measured IQ of 150 or so -- at any given level of rarity I bet other attributes that aren't exactly cognitive strengths matter more than all those cognitive strengths do. So if you stratify prospective mathematicians by IQ (which of course is very similar to R), I would expect a picture like this. IQ <= 130: mathematical success very very strongly dependent on R. IQ <= 150: mathematical success correlated with R, but other cognitive strengths matter more in many fields of mathematics. IQ > 150: mathematical success mostly determined by other cognitive and not-so-cognitive strengths.
I conjecture that the above is broadly consistent with your evidence. I think it is also consistent (with a little interpretive licence -- e.g., his idea of what constitutes exceptional intelligence is probably skewed upwards relative to most people's) with what Tao has said. I am willing to be convinced that I'm wrong on either point.
Thanks for the detailed comment.
I don't think that exceptional intelligence is either necessary or sufficient to be an exceptional mathematician. Tao's statement "But an exceptional amount of intelligence has almost no bearing on whether one is an exceptional mathematician." is a very strong statement: if he had said "plays only a moderate role in whether one is an exceptional mathematician" he would have been on much more solid ground.
I agree that the Langlands quote is by itself not strong evidence against Tao's assertion for the reasons that you give, but it's still evidence. I'm relying on many weak arguments. I'll gradually flesh them out in my sequence.
I share your intuition re: combinatorialists vs.geometers. One of my friends spent a lot of time with Chern, who struck him as being quite ordinary with respect to R, while being exceptional on a number of other dimensions. Grothendieck's self-assessment suggests that it is in fact possible to be amongst the greatest mathematicians without exceptional R.
A key point that you might be missing (certainly I did for many years) is that there just aren't many people of exceptional intelligence. Suppose that it were true that IQ is normally distributed: then the number of people of IQ 145+ would be 60x larger than the number of people of IQ 160+. Under this hypothesis, even if only 1 in 20 exceptional mathematicians had IQ 160+, that would mean that people in that range were 3x as likely as their IQ 145+ counterparts. to become exceptional mathematicians. It's been suggested that the distribution of IQ is in fact fat-tailed because of assortative mating, and this blunts the force of the aforementioned argument, but it's also true that more than 5% of exceptional mathematicians have IQ 160+: I think the actual figure is closer to 50%.
It should be noted that if measured IQ is fat-tailed, this is because there is something wrong with IQ tests. IQ is defined to be normally distributed with a mean of 100 and a standard deviation of either 15 or 16 depending on which definition you're using. So if measured IQ is fat-tailed, then the tests aren't calibrated properly(of course, if your test goes all the way up to 160, it is almost inevitably miscalibrated, because there just aren't enough people to calibrate it with).
You don't want to force a normal distribution on the data. You're free to do so if you'd like, e.g. by asking takers millions of questions so as to get very fine levels of granularity, and then mapping people at the 84th percentile of "questions answered correctly" to IQ 115, people at the 98th percentile to IQ 130, etc.
But what you really want is a situation where you have a (log)-linear relationship between standard deviations and other things that IQ correlates with, and if you force the data to obey a normal distribution, you'll lose this.
The rationale for using a normal distribution is the central limit theorem, but that holds only when the summands are uncorrelated: assortative mating can induce correlations between e.g. having gene A that increases IQ and having gene B that increases IQ.
Would you characterize this post as a reasonable description of what you're talking about in your discussion of "R"?
Yes, that's the guts of it.
Is this just a "screw you"?
How about: Terence was telling a polite white lie of the sort he probably often tells. Politeness is an easier guess than "poor metacognition".