DocumentCode :
3576367
Title :
Sentiment detection and visualization of Chinese micro-blog
Author :
Zhitao Wang ; Yu Zhiwen ; Liming Chen ; Bin Guo
Author_Institution :
Sch. of Comput. Sci., Northwestern Polytech. Univ., Xi´an, China
fYear :
2014
Firstpage :
251
Lastpage :
257
Abstract :
Micro-blog has been increasingly used for the public to express their opinions, and for organisations to detect public sentiment about social events. In contrast to the effort and progress made in English-based micro-blog analysis, research on Chinese micro-blog received relatively little attention. In this paper we examine and identify the key problems of this field, focusing particularly on the characteristics of innovative words, emoticon elements and hierarchical structure of Chinese “Weibo”. Based on the analysis we propose and develop associated theoretical and technological methods to address these problems. These include the development of new sentiment word mining method based on three wording standards and point-wise metrics, a rule set model for analyzing sentiment features of different linguistic components, and the corresponding methodology for calculating sentiment on multi-granularity considering emoticon elements. We use original Chinese tweets from a dataset of Sina Weibo to test and evaluate our new word discovery and sentiment detection methods. Initial results show that our new diction can improve sentiment detection, and demonstrate that our multi-level rule set method is more effective by giving 10.2% and 1.5% higher average accuracy than two existing methods for Chinese micro-blog sentiment analysis. In addition, we exploit visualisation techniques to study the relationships between online sentiment and real life, which can help depict the correlation between public emotions and events.
Keywords :
data visualisation; human factors; natural language processing; social networking (online); text analysis; Chinese Sina Weibo; Chinese microblog sentiment analysis; Chinese microblog visualization technique; Chinese tweets; English-based microblog analysis; emoticon elements; hierarchical structure; innovative words; linguistic components; multigranularity; multilevel rule set method; online sentiment; point-wise metrics; public emotions; public events; public opinions; public sentiment detection method; rule set model; sentiment word mining method; social events; word discovery; wording standards; Feature extraction; Mood; Pragmatics; Sentiment analysis; Silicon; Twitter; Vectors; Chinese Micro-blog; Sentiment Detection; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Science and Advanced Analytics (DSAA), 2014 International Conference on
Type :
conf
DOI :
10.1109/DSAA.2014.7058081
Filename :
7058081
Link To Document :
بازگشت