TheOtherDave comments on The Power of Reinforcement - Less Wrong

96 Post author: lukeprog 21 June 2012 01:42PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (467)

You are viewing a single comment's thread. Show more comments above.

Comment author: TheOtherDave 21 June 2012 07:04:33PM 1 point [-]

It can. Basically the failure modes are the same as when reinforcing others. In particular, it's common to fail to maintain consistent thresholds of self-reward.