x

LESSWRONG
LW

zoop

zoop

Message

78

2

19

4y

zoop

78

4y

zoop — LessWrong

Reminder: AI Safety is Also a Behavioral Economics Problem

Last week, OpenAI released the official version of o1, alongside a system card explaining their safety testing framework. Astute observers, most notably Zvi, noted something peculiar: o1's safety testing was performed on a model that... wasn't the release version of o1 (or o1 pro). Weird! Unexpected! If you care about...

Dec 20, 2024•2

zoop's Shortform

Nov 20, 2024•2