Ajeya Cotra — LessWrong

Survey on the acceleration risks of our new RFPs to study LLM capabilities

My team at Open Philanthropy just launched two requests for proposals: * Proposals to create benchmarks measuring how well LLM agents (like AutoGPT) perform on difficult real-world tasks, similar to recent work by ARC Evals.[1] * Proposals to study and/or forecast the near-term real-world capabilities and impacts of LLMs and...

Nov 10, 202329

AI Timelines

Introduction How many years will pass before transformative AI is built? Three people who have thought about this question a lot are Ajeya Cotra from Open Philanthropy, Daniel Kokotajlo from OpenAI and Ege Erdil from Epoch. Despite each spending at least hundreds of hours investigating this question, they still still...

Nov 10, 2023302

New roles on my team: come build Open Phil's technical AI safety program with me!

Open Phil announced two weeks ago that we’re hiring for over 20 roles across our teams working on global catastrophic risk reduction — and we’ll answer questions at our AMA starting tomorrow. Ahead of that, I wanted to share some information about the roles I’m hiring for on my team...

Oct 19, 202383

New blog: Planned Obsolescence

Kelsey Piper and I just launched a new blog about AI futurism and AI alignment called Planned Obsolescence. If you’re interested, you can check it out here. Both of us have thought a fair bit about what we see as the biggest challenges in technical work and in policy to...

Mar 27, 202396

Two-year update on my personal AI timelines

I worked on my draft report on biological anchors for forecasting AI timelines mainly between ~May 2019 (three months after the release of GPT-2) and ~Jul 2020 (a month after the release of GPT-3), and posted it on LessWrong in Sep 2020 after an internal review process. At the time,...

Aug 2, 2022293

Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover

I think that in the coming 15-30 years, the world could plausibly develop “transformative AI”: AI powerful enough to bring us into a new, qualitatively different future, via an explosion in science and technology R&D. This sort of AI could be sufficient to make this the most important century of...

Jul 18, 2022373

ARC's first technical report: Eliciting Latent Knowledge

ARC has published a report on Eliciting Latent Knowledge, an open problem which we believe is central to alignment. We think reading this report is the clearest way to understand what problems we are working on, how they fit into our plan for solving alignment in the worst case, and...

Dec 14, 2021230