You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

selylindi comments on Stupid Questions February 2015 - Less Wrong Discussion

9 Post author: Gondolinian 02 February 2015 12:36AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (198)

You are viewing a single comment's thread.

Comment author: selylindi 03 February 2015 05:16:26PM *  0 points [-]

Background: Statistics. Something about the Welch–Satterthwaite equation is so counterintuitive that I must have a mental block, but the equation comes up often in my work, and it drives me batty. For example, the degrees of freedom decrease as the sample size increases beyond a certain point. All the online documentation I can find for it gives the same information as Wikipedia, in which k = 1/n. I looked up the original derivation and, in it, the k are scaling factors of a linear combination of random variables. So at some point in the literature after the original derivation, it was decided that k = 1/n was superior in some regard; I lack the commitment needed to search the literature to find out why.

The stupid questions:

1) Does anyone know why the statistics field settled on k = 1/n?

2) Can someone give a relatively concrete mental image or other intuitive suggestion as to why the W-S equation really ought to behave in the odd ways it does?

Comment author: Lumifer 03 February 2015 06:01:54PM *  1 point [-]

Does anyone know why the statistics field settled on k = 1/n?

I am guessing that this is the default assumption of equal weighting or equal scaling of the variances that you are pooling. If you want to assign non-equal weights you should have some specific reason to do so.

I don't think it's "superior", it's just the simplest default in the absence of any additional information.