lukeprog comments on Intelligence Explosion analysis draft: types of digital intelligence - Less Wrong

2 Post author: lukeprog 14 November 2011 11:07PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (24)

You are viewing a single comment's thread. Show more comments above.

Comment author: lukeprog 15 November 2011 01:28:09PM 1 point [-]

I don't endorse Legg's formalization because it is limited to reinforcement learning agents.

Comment author: lessdazed 15 November 2011 05:45:48PM 2 points [-]

That's a good reason, and you should make that explicit.

Comment author: lukeprog 15 November 2011 09:05:22PM 0 points [-]

Good point.

Comment author: timtyler 18 November 2011 12:25:48AM *  0 points [-]

You can substitute "utility" for "reward", if you prefer. Reinforcement learning is a fairly general framework, except for its insistence on a scalar reward signal. If you talk to RL folk about the need for multiple reward signals, they say that sticking that information in the sensory channels is mathematically equivalent - which is kinda true.