Agenda Reflection: Testing Automated Alignment
Sharing a mostly inactive empirical agenda from mid 2025 (companion piece to a Techgov paper). Currently not planning to spend much time on it, but may consider collaborating. For a while I was curious about Scalable Oversight and AI Control. Both because they're cool to work on, and because they...