Changed first instance of "RL" to "Reinforcement Learning (RL)" because if I didn't immediately realize what it meant, someone who is learning this for the first time won't think of it either.
Changed first instance of "RL" to "Reinforcement Learning (RL)" because if I didn't immediately realize what it meant, someone who is learning this for the first time won't think of it either.