Title :
Detecting Threats of Violence in Online Discussions Using Bigrams of Important Words
Author_Institution :
Dept. of Comput. Sci., Oslo & Akershus Univ. Coll. of Appl. Sci., Oslo, Norway
Abstract :
Making violent threats towards minorities like immigrants or homosexuals is increasingly common on the Internet. We present a method to automatically detect threats of violence using machine learning. A material of 24,840 sentences from YouTube was manually annotated as violent threats or not, and was used to train and test the machine learning model. Detecting threats of violence works quit well with an error of classifying a violent sentence as not violent of about 10% when the error of classifying a non-violent sentence as violent is adjusted to 5%. The best classification performance is achieved by including features that combine specially chosen important words and the distance between those in the sentence.
Keywords :
Internet; learning (artificial intelligence); social networking (online); Internet; YouTube; bigrams; homosexuals; immigrants; machine learning; online discussions; threat detection; violent sentence; violent threats; Educational institutions; Europe; Feature extraction; Internet; Materials; Text mining; YouTube; classification; lasso logistic regression; machine learning; social media; text mining; violent threats;
Conference_Titel :
Intelligence and Security Informatics Conference (JISIC), 2014 IEEE Joint
Conference_Location :
The Hague
Print_ISBN :
978-1-4799-6363-8
DOI :
10.1109/JISIC.2014.64