AI Safety 101 - Chapter 5.2 - Unrestricted Adversarial Training — LessWrong