User Comment Replies

Do you feel that's still an issue when you comapre to the human expert baseline?

Human experts scored 79% on ProtocolQA, and o1-preview scored 81%
Human experts scored 38% on BioLP-bench, and o1-preview scored 36%

The fact that both human experts and o1-preview scored twice as high on ProtocolQA than BioLP-bench doesn't feel that inconsistent to me. It seems your BioLP-bench questions are just "twice as hard". I'd find it more inconsistent if o1-preview matched human expert performance on one test but not the other.

(There are other open questions about whether the human experts in both studies are comparable and how much time people had)

8Igor Ivanov5mo

And I'm unsure that experts are comparable, to be frank. Due to financial limitations, I used graduate students in BioLP, while the authors of LAB-bench used PhD-level scientists.

3Igor Ivanov5mo

I didn't have in mind o1, these exact results seem consistent. Here's an example I had in mind: Claude 3.5 Sonnet (old) scores 48% on ProtocolQA, and 7.1% on BioLP-bench GPT-4o scores 53% on ProtocolQA and 17% on BioLP-bench

Dangerous capability tests should be harder

LucaRighetti5mo10

Then that seems bad, but also that AI is not counterfactual -- so adding safeguards to models is probably not the way to get the risk down.

Is cybercrime really costing trillions per year?

Answer by LucaRighettiOct 03, 2024*323

I had looked into this for a previous research project. For what it's worth, I don't think there are any perfect sources, but my own BOTECs led me to believe the number people are usually after is ~~$10B-$100B~~ ~$30B-$300B:

FBI IC3 (2023): Headline figures that it receives $5-10B/yr of reported losses across the globe. If you assume this mostly only covers US victims (say 4X because the US is 25% of world GDP) and some go unreported (say 2X by dollar value), then you get something like $50B-$100B/yr globally
[New via JamieRV's comment] The “2007 GAO repor

... (read more)

4Fabien Roger6mo

This is great, thank you very much!

LESSWRONG
LW

All of LucaRighetti's Comments + Replies