gwern comments on A toy model of the control problem - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (24)
Me and feep have implemented a slightly-tweaked version of this using a DQN agent in
Reinforce.js. (Tabular turns out to be a bit infeasible.)At the moment, if you want to modify settings like in Karpathy's demos, you'll have to do something like download it locally to edit, with a command like
wget --mirror 'www.gwern.net/docs/rl/armstrong-controlproblem/index.html' && firefox ./www.gwern.net/docs/rl/armstrong-controlproblem/index.htmlThanks, most excellent!