cousin_it comments on How not to move the goalposts - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (71)
If you successfully design your C++ hiring criteria to be colorblind — to not notice the color of weasels, but only to notice how good they are at C++ — then performance on the hiring criteria will shadow weasel color as an indicator of C++ ability.
You might end up with 99 blue weasels hired for every one red weasel hired; but you will have successfully filtered out all the red weasels that are bad at C++, just as you successfully filtered out all the blue weasels that are bad at C++. After all, only a tiny fraction of blue weasels meet your C++ hiring criteria, too.
So at that point, you should actually trust your hiring criteria and compensate weasels with no regard for their color.
(It's true that red weasels are likely to, at one or two times in their career, need a couple of months off for frenzied weasel dancing. But it's also true that blue weasels are more likely to get trodden on by a cow and need medical leave, because they're less careful when walking through pastures on the way to work.)
My comment kinda assumed that hiring criteria meeting your strict standard of colorblindness are unexpectedly hard to design. Let's say all red weasels and most blue ones suck at C++, but some blue weasels completely rule. Also, once a month every weasel (blues and reds equally) unpredictably goes into a code frenzy for 182 minutes and temporarily becomes exactly as good as a blue one that rules. Your standardized test will mostly admit blue weasels that rule, but sometimes you'll get a random-colored weasel that sucks. If you're colorblind, you have no hope of weeding out the random suckers. But if you're color-aware, you can weed out half of them. Of course it also works if a tiny minority of red weasels can code instead of none.
The problems begin when the ability-distribution of red/blue weasels change, and the hiring-committee is still using restrictions based on the old distribution. eg red weasel ability has been steadily increasing, but the old hiring criteria still says "don't hire red weasels as they have no technical ability to speak of!"
but yes I agree - it's all difficult because it's hard to create a test that is as accurate as actual real-life working with a person. That's why the popularity of those awful "three month probation periods".