ThisSpaceAvailable comments on Does random reward evoke stronger habits? - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (17)
That's known as a VR 4 schedule (variable-ratio 4) because the behavior is rewarded an average of every four times the correct response is given. Variable schedules maximize what is known as resistance to extinction; the probability a behavior will decrease in frequency goes down. Continuous schedules are best for establishing a new behavior. I would expect they use continuous reinforcement whenever a new skill is being learned in the game.
Upvote for content, but I think that there's a typo in your second sentence
Fixed