DocumentCode :
1447081
Title :
Weakly Supervised Joint Sentiment-Topic Detection from Text
Author :
Lin, Chenghua ; He, Yulan ; Everson, Richard ; Rüger, Stefan
Author_Institution :
Dept. of Comput. Sci., Univ. of Exeter, Exeter, UK
Volume :
24
Issue :
6
fYear :
2012
fDate :
6/1/2012 12:00:00 AM
Firstpage :
1134
Lastpage :
1145
Abstract :
Sentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework called joint sentiment-topic (JST) model based on latent Dirichlet allocation (LDA), which detects sentiment and topic simultaneously from text. A reparameterized version of the JST model called Reverse-JST, obtained by reversing the sequence of sentiment and topic generation in the modeling process, is also studied. Although JST is equivalent to Reverse-JST without a hierarchical prior, extensive experiments show that when sentiment priors are added, JST performs consistently better than Reverse-JST. Besides, unlike supervised approaches to sentiment classification which often fail to produce satisfactory performance when shifting to other domains, the weakly supervised nature of JST makes it highly portable to other domains. This is verified by the experimental results on data sets from five different domains where the JST model even outperforms existing semi-supervised approaches in some of the data sets despite using no labeled documents. Moreover, the topics and topic sentiment detected by JST are indeed coherent and informative. We hypothesize that the JST model can readily meet the demand of large-scale sentiment analysis from the web in an open-ended fashion.
Keywords :
data mining; text analysis; latent Dirichlet allocation; opinion mining; probabilistic modeling framework; reverse-JST; sentiment analysis; subjective information detection; weakly supervised joint sentiment-topic detection; Analytical models; Biological system modeling; Data mining; Joints; Media; Motion pictures; Resource management; Sentiment analysis; joint sentiment-topic (JST) model.; latent Dirichlet allocation (LDA); opinion mining;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2011.48
Filename :
5710933
Link To Document :
بازگشت