lukstafi comments on Algorithms as Case Studies in Rationality - Less Wrong

27 Post author: abramdemski 14 February 2011 06:27PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (39)

You are viewing a single comment's thread. Show more comments above.

Comment author: lukstafi 15 February 2011 10:03:43AM *  0 points [-]

Only that "Exploration vs exploitation trade-off" is not an algorithm. Reinforcement learning (RL) is pretty much "non-algorithmic" (as Pei Wang would say). ETA: there are specific algorithms in RL (and in -- related -- planning and game playing), but the "trade-off" is a concept; it sure needs to be expressed algorithmically but is it fair to give credit to "algorithmicality" in this case?

Comment author: abramdemski 15 February 2011 02:55:47PM 0 points [-]

Right; when I say "I'm not systematic about it" I mean that I don't purposefully follow a specific algorithm. I would probably benefit from being a bit more systematic, but for the moment, I'm merely trying to "train my intuition".

I would hope that all these algorithms would be applied "non-algorithmically" in Pei Wang's sense-- that is, the ideas from the algorithm should interact dynamically with the rest of my thought process.

Comment author: wedrifid 15 February 2011 10:13:25AM 0 points [-]

Reinforcement learning is pretty much "non-algorithmic"

I'm rather certain I could implement reinforcement learning as an algorithm. In fact, I'm rather certain I have done so already. If I can point to an algorithm and say "look, that's a damn reinforcement learning algorithm" then I'm not sure how meaningful it can be to call it "non-algorithmic".

Comment author: lukstafi 15 February 2011 03:04:49PM *  0 points [-]

I concede, RL is a prototype example of algorithmic learning problem. The exploration vs exploitation trade-off is something that needs to be addressed by RL algorithms. It is fair then to say that we gain insight into the "trade-off" by recognizing how the algorithms "solve" it.

Comment author: wedrifid 15 February 2011 03:18:55PM 0 points [-]

It is also fair to say there is an abstract concept of 'trade off' that is not itself algorithmic.