Reminder: AI Safety is Also a Behavioral Economics Problem
Last week, OpenAI released the official version of o1, alongside a system card explaining their safety testing framework. Astute observers, most notably Zvi, noted something peculiar: o1's safety testing was performed on a model that... wasn't the release version of o1 (or o1 pro). Weird! Unexpected! If you care about...
Dec 20, 20242