DocumentCode :
3439264
Title :
Multi-Class Sentiment Analysis with Clustering and Score Representation
Author :
Farhadloo, Mohsen ; Rolland, Eric
Author_Institution :
Sch. of Eng., Ernest & Julio Gallo Manage. Program & EECS Group, Univ. of California, Merced, Merced, CA, USA
fYear :
2013
fDate :
7-10 Dec. 2013
Firstpage :
904
Lastpage :
912
Abstract :
Sentiment analysis or opinion mining is the field of computational study of people´s opinion expressed in written language or text. Sentiment analysis brings together various research areas such as natural language processing, data mining and text mining, and is fast becoming of major importance to organizations as they integrate online commerce into their operations. This paper proposes improved methods for aspect-level sentiment analysis. We propose to utilize bag of nouns instead of bog of words to improve the clustering results for aspect identification and a new feature set, score representation, that leads to more accurate sentiment identification. This scheme is based upon the three scores (positive ness, neutral ness and negative ness) that are learned from the data for each term. Using this new score representation scheme, we improve the performance of 3-class sentiment analysis on sentences by 20% in terms of f1-measure, as compared to previously published research. We demonstrate the usefulness of the methodology using data from the popular online travel information site TripAdvisor.com.
Keywords :
data mining; information analysis; learning (artificial intelligence); pattern clustering; 3-class sentiment analysis; TripAdvisor.com; aspect identification; aspect-level sentiment analysis; bag-of-nouns; bag-of-words; clustering; computational study; data mining; f1-measure; feature set; learning; multiclass sentiment analysis; natural language processing; negativeness score; neutralness score; online commerce; online travel information site; opinion mining; people opinion; positiveness score; score representation; sentiment analysis; sentiment identification; text mining; Clustering algorithms; Data mining; Feature extraction; Organizations; Support vector machines; Vectors; Vocabulary; Sentiment Analysis; Text Mining; User Reviews;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4799-3143-9
Type :
conf
DOI :
10.1109/ICDMW.2013.63
Filename :
6754018
Link To Document :
بازگشت