You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Stefan_Schubert comments on Could auto-generated troll scores reduce Twitter and Facebook harassments? - Less Wrong Discussion

0 Post author: Stefan_Schubert 30 April 2015 02:05PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (42)

You are viewing a single comment's thread. Show more comments above.

Comment author: Stefan_Schubert 01 May 2015 11:12:01AM *  0 points [-]

My suggestion was not to train the system on user ratings:

The first is to let a number of sensible people give their troll scores of different Facebook posts and tweets (using the general and vague definition of what is to count as trolling). You would feed this into your algorithms, which would learn which combinations of words are characteristic of trolls (as judged by these people), and which arent't. The second is to simply list a number of words or phrases which would count as characteristic of trolls, in the sense of the general and vague definition.

Comment author: V_V 01 May 2015 11:21:06PM 3 points [-]

So, essentially it would depend on the company opinion.

Anyway, lists of words or short phrases won't work. Keep in mind that trolls are human intelligences, any AI short of Turing-test level won't beat human intelligences at their own game.