The Power of Reinforcement — LessWrong