gwern comments on Welcome to Less Wrong! (5th thread, March 2013) - Less Wrong

27 Post author: orthonormal 01 April 2013 04:19PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (1750)

You are viewing a single comment's thread. Show more comments above.

Comment author: HumanitiesResearcher 18 April 2013 05:12:18AM 1 point [-]

I have just such a thing, referred to as "Marks." I haven't yet included that in the code, because I wanted to explore the viability of the method first. So to retreat to the earlier question, why does my proposal strike you as a GIGO situation?

Comment author: gwern 18 April 2013 04:26:36PM 1 point [-]

So to retreat to the earlier question, why does my proposal strike you as a GIGO situation?

You claimed to not know what printers there were, how many there were, and what connection they had to 'Marks'. In such a situation, what on earth do you think you can infer at all? You have to start somewhere: 'we have good reason to believe there were not more than 20 printers, and we think the London printer usually messed up the last page. Now, from this we can start constructing these phylogenetic trees indicating the most likely printers for our sample of books...' There is no view from nowhere, you cannot pick yourself up by your bootstraps, all observation is theory-laden, etc.

Comment author: HumanitiesResearcher 21 April 2013 04:27:25PM 0 points [-]

This all sounds good to me. In fact, I believe that researchers in the humanities are especially (perhaps overly) sensitive to the reciprocal relationship between theory and observation.

I may have overstated the ignorance of the current situation. The scholarly community has already made some claims connecting the Big Book to Print Shops [x,y,z]. The problem is that those claims are either made on non-quantitative bases (eg, "This mark seems characteristic of this Print Shop's status.") or on a very naive frequentist basis (eg, "This mark comes up N times, and that's a big number, so it must be from Print Shop X"). My project would take these existing claims as priors. Is that valid?

Comment author: gwern 21 April 2013 05:14:54PM 0 points [-]

I have no idea. If you want answers like that, you should probably go talk to a statistician at sufficient length to convey the domain-specific knowledge involved or learn statistics yourself.