bag-of-words That is a feature representation, and it has little to do with the learning method. You could encode a text as bag-of-words, and train an SVM on these features.
Yes, sure, but the most generic way is just to look at a historgram distance between word occurrences. I guess that would generically fall under k-means or similar methods, but that's what I was referring to by citing bag-of-words as a method on its own. Of course you can mix and match and cascade all of these to produce different methods.
http://cacm.acm.org/magazines/2011/6/108655-qa-a-lifelong-learner/fulltext
Wow, this is quite interesting. What are your thoughts?