gwern comments on [LINK] SMBC on Confirmation Bias - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (29)
Right. In the shower, I also realized that this is just comparing averages- if, say, 10% of the population really hates spoilers, but the other 90% enjoys them enough to make the average for spoiling higher, it's still sensible to put spoiler warnings as a courtesy to the 10%, because the comparison is "cost to warn vs. benefit of warning" not "spoil for everyone or spoil for no one."
I don't think that's the issue; if you look at the graphs, the standard deviation is tiny compared to the variability between stories, and in some the spoilers/no-spoilers don't even overlap. The stats:
The graphs show standard error, not standard deviation. Standard error is standard deviation divided by the square root of the sample size. It's included on graphs to show which differences are statistically significant - it does not give a sense of the variability within a group.
Cohen's d counts standard deviations (d=.18 means that the two means are .18 standard deviations apart), so there is actually a lot of overlap between the groups.
I agree that the small standard deviation suggests that either that doesn't happen or the people in question are much less prevalent than 10% of the population (a number I picked because I have ten fingers). I also suspect that the mechanism roystgnr identified is stronger than the mechanism I identified.
This study isn't set up to differentiate between people, which is what we would need to make a warning policy.
(I had an erroneous statement about the sample size here, which I've deleted.)
Small n? They used 819 subjects - that's bigger than pretty much any psychology cited on LW!
Hmm. That looks like a memory error on my part, as rereading it I don't see what I thought the n was (I remembered ~40). I think I saw 30 subjects, failed to multiply by 24, and it got fuzzed with the passing of time.
Thanks for the correction!