wnoise comments on Polyhacking - Less Wrong

75 Post author: Alicorn 28 August 2011 08:35AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (603)

You are viewing a single comment's thread. Show more comments above.

Comment author: wnoise 28 August 2011 06:57:38PM *  4 points [-]

Something's wrong with those numbers. Medians of integer-valued quantities are always integers or half-integers.

EDIT: I've taken a look at the report, and it doesn't say anything about how they calculate medians, so I don't know how they're fudging their numbers to get these out.

EDIT 2: I should also say "good job for looking at the research and getting numbers", even if I'd like these researchers to be more transparent as to what they're actually reporting.

Comment author: satt 29 August 2011 04:09:33AM 1 point [-]

An uninformed guess: those medians are presumably based on survey data, so they might've been adjusted using the survey's sampling weights.

Comment author: wnoise 29 August 2011 05:50:32AM *  3 points [-]

It's almost certainly true, perhaps doing a weighted average of the medians of subgroups. However, any method that does that is not producing a median. A good way of doing that adjustment might give "cooked" numbers for the various options, but the point where 50% are below and 50% are above would still almost certainly be an integer. And if it is actually balanced (highly unlikely with so many data points), so that any number greater than X and less than X+1 divides the population in two, then the convention is to report X + 1/2. There is no information about the median that anything past the decimal point can actually convey.