Title :
Are SentiWordNet scores suited for multi-domain sentiment classification?
Author :
Denecke, Kerstin
Author_Institution :
L3S Res. Center, Hannover, Germany
Abstract :
Motivated by the numerous applications of analysing opinions in multi-domain scenarios, this paper studies the potential of a still rarely considered approach to the problem of multi-domain sentiment analysis based on Senti-WordNet as lexical resource. SentiWordNet scores are exploited together with additional features to assign a polarity to a text using machine learning. On the other hand, a rule-based approach is studied based on sentiment scores. The introduced methods are tested on single domains of a real-world data set consisting of documents in six different domains, but also in cross-domain settings. The results show that for cross-domain sentiment analysis rule-based approaches with fix opinion lexica are unsuited. For machine-learning based sentiment classification a mixture of documents of different domains achieves good results.
Keywords :
learning (artificial intelligence); natural language processing; pattern classification; text analysis; SentiWordNet score; machine learning; multidomain sentiment analysis; multidomain sentiment classification; opinion analysis; opinion lexica; rule-based approach; sentiment score; text polarity; Composite materials; Discussion forums; Feeds; Frequency; Functional analysis; Internet; Machine learning; Materials testing; Mutual information; Statistics;
Conference_Titel :
Digital Information Management, 2009. ICDIM 2009. Fourth International Conference on
Conference_Location :
Ann Arbor, MI
Print_ISBN :
978-1-4244-4253-9
Electronic_ISBN :
978-1-4244-4254-6
DOI :
10.1109/ICDIM.2009.5356764