Introducing ControlArena: A library for running AI control experiments
This has been a collaborative effort between UK AISI, Redwood Research and various other orgs! Today, we’re launching ControlArena, a library for running AI control experiments. As AI systems become more powerful and autonomous, ensuring that they remain under human control is one of the central challenges of AI security....