D Wong

Message

Systems Analysis: AI Alignment and the Principal-Agent Problem

This post examines AI alignment through the lens of systems thinking and safety engineering. We aim to identify structural mechanisms that can maintain alignment in complex sociotechnical systems, systems where AIs interact with multiple human operators and stakeholders.[1] One conception of AI misalignment is a control problem where the behavior...

Dec 11, 20251

Modularity and assembly: AI safety via thinking smaller

In 1984, Charles Perrow wrote the book Normal Accidents: Living with High-Risk Technologies. The book is an examination of the causes of accidents and disasters in highly complex, technological systems. In our modern time it can be helpful to review the lessons that Perrow set forth, as there may be...

Feb 20, 20252

LESSWRONG
LW

LESSWRONG
LW

D Wong

D Wong

Systems Analysis: AI Alignment and the Principal-Agent Problem

Modularity and assembly: AI safety via thinking smaller

D Wong

D Wong

Systems Analysis: AI Alignment and the Principal-Agent Problem

Modularity and assembly: AI safety via thinking smaller