gwern comments on Participation in the LW Community Associated with Less Bias - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (49)
Out of curiosity, I looked at what a more appropriate logistic regression would say (using this guide); given the categorical variable of the question answer, can one predict how many survey entries were missing/omitted (as a proxy for time investment)? The numbers and method are a little different from a t-test, and the result is a little less statistically significant, but as before there's no real relationship*:
* a note to other analyzers: it's really important to remove null answers/NAs because they'll show relationships all over the place. In this example, if you leave NAs in for the
CFARQuestion1field, you'll wind up getting a very statistically significant relationship - because everyCFARQuestionleft NA by definition increasesMissingAnswersby 1! And people who didn't answer that question probably didn't answer a lot of other questions, so the NA respondents enable a very easy reliable prediction ofMissingAnswers...