Claude Sonnet 3.7 (often) knows when it’s in alignment evaluations — LessWrong