Like Self-fulfilling misalignment data might be poisoning our AI models, what are historical examples of self-fulfilling prophecies that have affected AI alignment and development?

Put a few potential examples below to seed discussion.

New Answer
New Comment

3 Answers sorted by

Chipmonk

60

https://x.com/sama/status/1621621724507938816 

Chipmonk

40

Situational Awareness and race dynamics? h/t Jan Kulveit @Jan_Kulveit 

Curated and popular this week