Title :
SentiCorr: Multilingual Sentiment Analysis of Personal Correspondence
Author :
Tromp, Erik ; Pechenizkiy, Mykola
Author_Institution :
Dept. of Comput. Sci., Eindhoven Univ. of Technol., Eindhoven, Netherlands
Abstract :
We present the system for automated sentiment analysis on multilingual user generated content from various social media and e-mails. One of the main goals of the system is to make people aware how much positive and negative content they read and write. The output is summarized into a database allowing for basic OLAP style exploration of the data across basic dimensions including for example time and correspondents dimensions. The sentiment analysis is based on a four-step approach including language identification for short texts, part-of-speech tagging, subjectivity detection and polarity detection techniques. We extensively tested our system on data from Twitter, Face book and Hyves. We also developed an MS Outlook sentiment analysis plug-in allowing people to see how positive or negative the content of the e-mails is and provide confirmatory or correcting feedback on the correctness of the sentiment classification at the sentence or e-mail level.
Keywords :
data mining; information analysis; pattern classification; social networking (online); Facebook; Hyves; MS Outlook sentiment analysis; OLAP style data exploration; SentiCorr system; Twitter; e-mail; multilingual sentiment analysis; multilingual user generated content; online analytical processing; part-of-speech tagging technique; personal correspondence analysis; polarity detection technique; sentiment classification; short text language identification technique; social media; subjectivity detection technique; Electronic mail; Facebook; Media; Stress; Stress measurement; Tagging; Twitter; demo; multilingual; personal correspondence; sentiment classification;
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
DOI :
10.1109/ICDMW.2011.152