DocumentCode :
3778709
Title :
Sentiment classification of short texts based on semantic clustering
Author :
Yunchao He;Chin-Sheng Yang;Liang-Chih Yu;K. Robert Lai;Weiyi Liu
Author_Institution :
Department of Computer Science & Engineering, Yuan Ze University, Taiwan
fYear :
2015
Firstpage :
54
Lastpage :
57
Abstract :
Due to the popularity and ubiquity of social networks, sentiment analysis has become an important and well-covered research area. Short texts usually encounter sparsity problems in representations for their limited texts length. We address this issue by clustering short texts to form a long text. Specifically, we first use k-means clustering algorithms to form k clusters, where each cluster contains texts having close semantic similarity with the same sentiment polarity. Then, these clusters are used to train classifier. In evaluation, the unlabeled text is merged to the most similar positive and negative clusters respectively, and its sentiment polarity is determined by the change of the two clusters´ probabilistic estimates. Experiments on the Twitter dataset show that our approach can achieve a significantly better performance than the bag-of-words method.
Keywords :
"Semantics","Sentiment analysis","Training data","Training","Supervised learning","Encyclopedias"
Publisher :
ieee
Conference_Titel :
Orange Technologies (ICOT), 2015 International Conference on
Type :
conf
DOI :
10.1109/ICOT.2015.7498505
Filename :
7498505
Link To Document :
بازگشت