In January, I spent over 100 hours: * reading 50+ AI safety 'plans', * interviewing with 10+ AI safety researchers; and * thinking about AI safety strategy, drawing on my 2.5 years in the field This list of routes to AI safety success (updated June 2025) is a key output...
This is a summary of Yann LeCun's talk "Mathematical Obstacles on the Way to Human-Level AI". I've tried to make it more accessible to people who are familiar with basic AI concepts, but not the level of maths Yann presents. You can watch the original talk on YouTube. I disagree...
This is a (slightly chaotic and scrappy) list of gaps in AI safety literature that I think would be useful/interesting to exist. I’ve broken it down into sections: * AI safety problems beyond alignment: Better explaining non-misalignment catastrophic AI risks. * Case studies of analogous problems: Historical lessons from nuclear...
tldr: Government policymakers want to read research, but lack journal access. Your research needs to be open access if you want policymakers to read it, and you should prefer citing open access resources to improve epistemic legibility. Policymakers don’t have access Many seem to assume that government policymakers would have...
AI risk discussions often focus on malfunctions, misuse, and misalignment. But this often misses other key challenges from advanced AI systems: 1. Coordination: Race dynamics may encourage unsafe AI deployment, even from ‘safe’ actors. 2. Power: First-movers with advanced AI could gain permanent military, economic, and/or political dominance. 3. Economics:...
The EU and UK's Network and Information Systems (NIS) Regulations aim to improve the cybersecurity of essential services and important digital providers. They set requirements around network security, information system security, physical security, incident handling, business continuity and security auditing. I think OpenAI needs to comply with these regulations. This...
This article explains concrete AI governance practices people are exploring as of August 2024. Prior summaries have mapped out high-level areas of work, but rarely dive into concrete practice details. For example, they might describe roles in policy development, advocacy and implementation - but don’t specify what practices these policies...