How To Win The AI Box Experiment (Sometimes) — LessWrong