Whose track record of AI predictions would you like to see evaluated?

Jonny Spicer

2

[ Question ]

Whose track record of AI predictions would you like to see evaluated?

by Jonny Spicer

29th Jan 2025

1 min read

A

1 3

2

As uncertainty grows around how AI development will affect culture and society, it becomes more valuable to compare track records of predictions about technological progress.

I've recently been working on automating parts of the methodology from Arb's Scoring The Big 3's Predictive Performance report^[1], and have had some promising preliminary results. I hope to try to automate most of the steps in the original report, making it feasible to analyse many more track records and publish the results.

I am particularly interested in the following questions:

Which track record(s) would you find valuable to have evaluated in a similar way to Asimov, Clarke and Heinlein’s, as in the Arb report?
What would you want to see from an LLM-based evaluation that would give you confidence that the results are meaningful and accurate?

^{^}
See also original Cold Takes post explaining why such evaluations are valuable

AI TimelinesForecasting & PredictionFuturismAI

Personal Blog

2

New Answer

New Comment

1 Answers sorted by
top scoring

GRI

Feb 25, 2025

20

I would love to see an analysis and overview of predictions from the Dwarkesh podcast with Leopold. One for Situational awareness would be great too.

[-]Jonny Spicer1y41

That's good to know - transcripts from Dwarkesh's podcast are one of the things I'd be most excited about evaluating too and agreed the one with Leopold seems like a great one to start with.

LESSWRONG
LW

LESSWRONG
LW

2

[ Question ]

Whose track record of AI predictions would you like to see evaluated?

2

2

1 Answers sorted by
top scoring

Feb 25, 2025

2

2

[ Question ]

Whose track record of AI predictions would you like to see evaluated?

2

2

1 Answers sorted by top scoring

Feb 25, 2025

2

1 Answers sorted by
top scoring