You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Punoxysm comments on Open Thread, May 19 - 25, 2014 - Less Wrong Discussion

2 Post author: somnicule 19 May 2014 04:49AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (289)

You are viewing a single comment's thread. Show more comments above.

Comment author: Punoxysm 20 May 2014 03:38:26AM 0 points [-]

As a former Natural Language Processing researcher, the technology definitely exists. Using general vocabulary combined with many (semi-manually generated) regexes to figure out argumentative or weaselly sentences with decent accuracy should be doable. It could improve over time if you input exemplar sentences you came across.

Comment author: sixes_and_sevens 20 May 2014 11:38:10AM *  2 points [-]

Do you have a recommendation for a good language-agnostic text / reference resource on NLP?

ETA: my own background is a professional programmer with a reasonable (undergrad) background in statistics. I've dabbled with machine learning (I'm in the process of developing this as a skill set) and messed around with python's nltk. I'd like a broader conceptual overview of NLP.

Comment author: Punoxysm 20 May 2014 08:13:48PM 2 points [-]

I'd recommend this book for a general overview : http://nlp.stanford.edu/fsnlp/

However, tasks like parsing are unnecessary for many tasks. A simple classifier on a sparse vector of word counts can be quite effective as a starting point in classifying sentence/document content.