You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

TheOtherDave comments on Group rationality diary, 5/28/12 - Less Wrong Discussion

3 Post author: cata 29 May 2012 04:10AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (21)

You are viewing a single comment's thread. Show more comments above.

Comment author: TheOtherDave 30 May 2012 06:02:15PM 3 points [-]

Second the endorsement, but it's important to understand that the relevant average is the particular animal's current baseline (not, for example, the average behavior of similar individuals). That is, you look at the behavior you're getting, and you reward the top N% of behavior along whatever dimension you want to reinforce. Over time the cluster of behaviors will shift in that direction. Keep rewarding the top N% and it will keep shifting in that direction until other factors make that impossible.