Deceptive Alignment

Applied to Language Models Model Us by eggsyntax ago
Applied to Selfish AI Inevitable by Davey Morse ago