2
29Illusory Safety: Redteaming DeepSeek R1 and the Strongest Fine-Tunable Models of OpenAI, Anthropic, and GoogleChengCheng,
Brendan Murphy,
Adrià Garriga-alonso,
Yashvardhan Sharma,
dsbowen,
smallsilo,
Yawen Duan,
ChrisCundy,
Hannah Betts,
AdamGleave,
Kellin Pelrine