DocumentCode :
2884573
Title :
Harnessing Twitter "Big Data" for Automatic Emotion Identification
Author :
Wenbo Wang ; Lu Chen ; Thirunarayan, Krishnaprasad ; Sheth, A.P.
Author_Institution :
Kno.e.sis Center, Wright State Univ., Dayton, OH, USA
fYear :
2012
fDate :
3-5 Sept. 2012
Firstpage :
587
Lastpage :
592
Abstract :
User generated content on Twitter (produced at an enormous rate of 340 million tweets per day) provides a rich source for gleaning people\´s emotions, which is necessary for deeper understanding of people\´s behaviors and actions. Extant studies on emotion identification lack comprehensive coverage of "emotional situations" because they use relatively small training datasets. To overcome this bottleneck, we have automatically created a large emotion-labeled dataset (of about 2.5 million tweets) by harnessing emotion-related hash tags available in the tweets. We have applied two different machine learning algorithms for emotion identification, to study the effectiveness of various feature combinations as well as the effect of the size of the training data on the emotion identification task. Our experiments demonstrate that a combination of unigrams, big rams, sentiment/emotion-bearing words, and parts-of-speech information is most effective for gleaning emotions. The highest accuracy (65.57%) is achieved with a training data containing about 2 million tweets.
Keywords :
emotion recognition; grammars; learning (artificial intelligence); social networking (online); automatic emotion identification task; bigrams; comprehensive coverage; emotion-bearing words; emotion-related hashtags; emotional situations; feature combinations; harnessing Twitter big data; large emotion-labeled dataset; machine learning algorithms; parts-of-speech information; people behaviors; people emotions; small training datasets; training data; unigrams; user generated content; Accuracy; Blogs; Educational institutions; Training; Training data; Twitter; USA Councils; Emotion Analysis; Emotion Identification; Emotion Intelligence; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on and 2012 International Confernece on Social Computing (SocialCom)
Conference_Location :
Amsterdam
Print_ISBN :
978-1-4673-5638-1
Type :
conf
DOI :
10.1109/SocialCom-PASSAT.2012.119
Filename :
6406313
Link To Document :
بازگشت