timtyler comments on Backchaining causes wishful thinking - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (18)
This all sounds very strange to me. If there is a supervisor - but all they do is use a carrot and a stick - then I think that would generally be classified as reinforcement learning. Supervised learning is where the learner gets given the correct outputs - or is told the right answers.
http://en.wikipedia.org/wiki/Supervised_learning
http://en.wikipedia.org/wiki/Unsupervised_learning
http://en.wikipedia.org/wiki/Semi-supervised_learning
I'm saying that applying carrot/stick is equivalent to saying yes/no.
I deleted the whole paragraph about supervised/unsupervised, since it contributed nothing and was obviously a distraction.