tondwalkar comments on Harry Potter and the Methods of Rationality discussion thread, part 22, chapter 93 - Less Wrong

5 [deleted] 06 July 2013 03:02AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (354)

You are viewing a single comment's thread. Show more comments above.

Comment author: tondwalkar 07 July 2013 02:15:40AM 0 points [-]

but all rewards are subject to Goodhart's Law. I expect to see people doing a lot of ill-thought-out somethings because the reward structure is too simplified.

Well, since the reward structure isn't explicit, and we expect McGonagoll to get much smarter on a much smaller timescale than opportunities to earn a reward by "disobeying McGonagoll according to your own judgement."