DocumentCode :
2634990
Title :
Building a General Purpose Cross-Domain Sentiment Mining Model
Author :
Whitehead, Matthew ; Yaeger, Larry
Author_Institution :
Sch. of Inf., Indiana Univ., Bloomington, IN, USA
Volume :
4
fYear :
2009
fDate :
March 31 2009-April 2 2009
Firstpage :
472
Lastpage :
476
Abstract :
Building a model using machine learning that can classify the sentiment of natural language text often requires an extensive set of labeled training data from the same domain as the target text. Gathering and labeling new datasets whenever a model is needed for a new domain is time-consuming and difficult, especially if a dataset with numeric ratings is not available. In this paper we consider the problem of building models that have a high sentiment classification accuracy without the aid of a labeled dataset from the target domain. We show that ensembles of existing domain models can be used to achieve a classification accuracy that approaches that of models trained on data from the target domain.
Keywords :
data mining; learning (artificial intelligence); natural language processing; pattern classification; dataset labeling; general purpose cross-domain sentiment mining model; high sentiment classification accuracy; machine learning; natural language text; Digital cameras; Drugs; Informatics; Machine learning; Motion pictures; Natural languages; Portable computers; Support vector machines; Testing; Training data; Data Mining; Machine Learning; Opinion Mining; Sentiment Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
Type :
conf
DOI :
10.1109/CSIE.2009.754
Filename :
5171041
Link To Document :
بازگشت