timtyler comments on Intelligence Explosion analysis draft: types of digital intelligence - Less Wrong

2 Post author: lukeprog 14 November 2011 11:07PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (24)

You are viewing a single comment's thread. Show more comments above.

Comment author: timtyler 18 November 2011 12:25:48AM *  0 points [-]

You can substitute "utility" for "reward", if you prefer. Reinforcement learning is a fairly general framework, except for its insistence on a scalar reward signal. If you talk to RL folk about the need for multiple reward signals, they say that sticking that information in the sensory channels is mathematically equivalent - which is kinda true.