Invulnerable Incomplete Preferences: A Formal Statement
Produced as part of the SERI ML Alignment Theory Scholars Program - Summer 2023 Cohort. My thanks to Eric Chen, Elliott Thornley, and John Wentworth for invaluable discussion and comments on earlier drafts. All errors are mine. This article presents a few theorems about the invulnerability of agents with incomplete...