Sarokrae comments on Reinforcement and Short-Term Rewards as Anti-Akratic - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (27)
Some of my recent forays into reinforcement learning have been very helpful. I should point out that my life is made a whole lot easier by having a very co-operative OH who is willing to reward me or withhold reward as appropriate, so I've not needed to resort to building a robot!
Things that have been successful:
But yeah, having a person help me do it means I avoid any sort of precommitment failure, and generally makes things much easier!
(Side note: Curly brackets clearly denote euphemisms, but I didn't want to be too crude.)