1. The NYT article Your A.I. Radiologist Will Not Be With You Soon reports, “Leaders at OpenAI, Anthropic and other companies in Silicon Valley now predict that A.I. will eclipse humans in most cognitive tasks within a few years… The predicted extinction of radiologists provides a telling case study. So...
Epistemic status: I think you should interpret this as roughly something like “GenAI is not so powerful that it shows up in the most obvious way of analyzing the data, but maybe if someone did a more careful analysis which controlled for e.g. macroeconomic trends they would find that GenAI...
Kathleen Finlinson & Ben West, May 2025 Mission * We are working to establish the possibility of cooperation between humans and agentic AIs. We are starting with a simple version of making deals with current AIs, and plan to iterate based on what we learn and as AI technology advances....
METR is developing evaluations for AI R&D capabilities, such that evaluators can determine if further AI development risks a “capabilities explosion”, which could be extraordinarily destabilizing if realized. METR is hiring ML research engineers/scientists to drive these AI R&D evaluations forward. Why focus on risks posed by AI R&D capabilities?...
In March of this year, 30,000 people, including leading AI figures like Yoshua Bengio and Stuart Russell, signed a letter calling on AI labs to pause the training of AI systems. While it seems unlikely that this letter will succeed in pausing the development of AI, it did draw substantial...
Meta: this is a small interpretability project I was interested in. I'm sharing in case it's useful to other people, but I expect it will not be of wide interest. Summary 1. Locating and Editing Factual Associations in GPT created a method to modify the weights of a language model...