All of adamk's Comments + Replies

adamk10

I don't think we were thinking too closely about whether regression slopes are preferable to correlations to decide the susceptibility of a benchmark to safetywashing. We mainly focused on correlations for the paper for the sake of having a standardized metric across benchmarks.

Figure 4 seems to be the only place where we mention the slope of the regression line. I'm not speaking for the other authors here, but I think I agree that the implicit argument for saying that "High-correlation + low-slope benchmarks are not necessarily liable for safetywashing" h... (read more)

adamk10

I'll begin by saying more about our approach for measuring "capabilities scores" for a set of models (given their scores on a set of  standard capabilities benchmarks). We'll assume that all benchmark scores have been normalized. We need some way of converting each model's  benchmark scores into one capabilities score per model. Averaging involves taking a weighted combination of the benchmarks, where the weights are equal. Our method is similarly a weighted combination of the benchmarks, but where the weights are higher for benchmark... (read more)

2shash42
Thanks, the rationale for using PCA was quite interesting. I also quite like the idea of separating different model classes for this evaluation.
adamk20

Thank you! I'd be glad to include this and any other corrections in an edit once contest results are released. Are there any other errors which catch your eye?

adamk20

I would honestly be interested to see a detailed writeup with good examples of this "maybe amazing" vs "probably good" distinction.

A subtlety here is that the traits that make a candidate a potential outlier are often very different from the traits that would make them “pretty good,” so improving your filtering process to produce more “pretty good” candidates won’t necessarily increase the rate of finding outliers, and might even decrease it.

Most important point I'd still want to grok is what this "might even decrease it" looks like. What are industry exam... (read more)

7gwern
https://www.lesswrong.com/posts/dC7mP5nSwvpL65Qu5/why-the-tails-come-apart is relevant.