Austin Witte

Message

Austin Witte

It Can't Be Mesa-Optimizers All The Way Down (Or Else It Can't Be Long-Term Supercoherence?)

Epistemic status: After a couple hours of arguing with myself, this still feels potentially important, but my thoughts are pretty raw here. Hello LessWrong! I’m an undergraduate student studying at the University of Wisconsin-Madison, and part of the new Wisconsin AI Safety Initiative. This will be my first “idea” post...

Mar 31, 202320

A Brief Overview of AI Safety/Alignment Orgs, Fields, Researchers, and Resources for ML Researchers

Crossposted to EA Forum [Link] TLDR: I’ve written an overview of the AI safety space, tagged by keywords and subject/field references (short version, long version). The aim is to allow existing ML researchers to quickly gauge interest in the subject based on their existing subfield skills and interests! Overview When...

Feb 2, 202318

LESSWRONG
LW

LESSWRONG
LW

Austin Witte

Austin Witte

Austin Witte

It Can't Be Mesa-Optimizers All The Way Down (Or Else It Can't Be Long-Term Supercoherence?)

A Brief Overview of AI Safety/Alignment Orgs, Fields, Researchers, and Resources for ML Researchers

Austin Witte

Austin Witte

Austin Witte

It Can't Be Mesa-Optimizers All The Way Down (Or Else It Can't Be Long-Term Supercoherence?)

A Brief Overview of AI Safety/Alignment Orgs, Fields, Researchers, and Resources for ML Researchers

Overview