FrameBenignly comments on Does random reward evoke stronger habits? - Less Wrong

1 Post author: Bound_up 17 August 2015 09:03PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (17)

You are viewing a single comment's thread.

Comment author: FrameBenignly 18 August 2015 04:08:16AM *  3 points [-]

That's known as a VR 4 schedule (variable-ratio 4) because the behavior is rewarded an average of every four times the correct response is given. Variable schedules maximize what is known as resistance to extinction; the probability a behavior will decrease in frequency goes down. Continuous schedules are best for establishing a new behavior. I would expect they use continuous reinforcement whenever a new skill is being learned in the game.

Comment author: ThisSpaceAvailable 18 August 2015 10:16:22PM 0 points [-]

Upvote for content, but I think that there's a typo in your second sentence

Variable schedules maximize what is known as resistance to extinction, the probability a behavior will decrease in frequency goes down. Perhaps a semicolon instead of a comma, or "as frequency of rewards ... " instead of "in frequency ...", was intended?

Comment author: FrameBenignly 19 August 2015 05:20:49AM 0 points [-]

Fixed