You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Lumifer comments on The "best" mathematically-informed topics? - Less Wrong Discussion

13 Post author: Capla 14 November 2014 03:39AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (113)

You are viewing a single comment's thread. Show more comments above.

Comment author: othercriteria 14 November 2014 02:44:22PM *  6 points [-]

do what gwern does

Or do the complete opposite.

The impression I get of gwern is that he reads widely, thinks creatively, and experiments frequently, so he is constantly confronted with hypotheses that he has encountered or has generated. His use of statistics is generally confirmatory, in that he's using data to filter out unjustified hypotheses so he can further research or explore or theorize about the remaining ones.

Another thing you can do with data is exploratory data analysis, using statistics to pull out interesting patterns for further consideration. The workflow for this might look more like:

  • Acquire (often multivariate) data from another researcher, source, or experiment.
  • Look at its marginal distributions to check your understanding of the system and catch really obvious outliers.
  • Maybe use tools like mixture modeling or Box-Cox transformation to clarify marginal distributions.
  • Use statistical tools like (linear, logistic, support vector, etc.) regression, PCA, etc., to find patterns in the data.
  • Do stuff with the resulting patterns: think up mechanisms, do confirmatory analysis, check literature, show them to other people, etc.

A lot of what you get out of this process will be spurious, but seeing hypotheses that the data seemed to support go down in flames is a good way to convince yourself of the value of confirmatory analysis, and of tools for dealing with this multiple testing problem.

I remember Gelman saying useful stuff like this, but it's been a while since I read that post so I might be mischaracterizing it.

(Ilya, you know all of this, surely at a deeper level than I do. I'm just rhetorically talking to you as a means to dialogue at Capla. Gwern, hopefully my model of you is not too terrible.)

Comment author: Lumifer 14 November 2014 03:45:27PM 2 points [-]

using statistics to pull out interesting patterns for further consideration

That's usually called "data mining" and is a popular activity. Unfortunately many people think that's all they need and stop before the confirmatory phase.