DocumentCode :
33336
Title :
Creating a Fine-Grained Corpus for Chinese Sentiment Analysis
Author :
Yanyan Zhao ; Bing Qin ; Ting Liu
Author_Institution :
Harbin Inst. of Technol., Harbin, China
Volume :
30
Issue :
1
fYear :
2015
fDate :
Jan.-Feb. 2015
Firstpage :
36
Lastpage :
43
Abstract :
Writing comments on products or news has become a popular activity in social media. The amount of opinionated text available online has been growing rapidly, increasing the need for techniques that can analyze opinions expressed in such text so that reviews can be easily absorbed by users. To date, most techniques depend on annotated corpora. However, existing corpora are almost sentence-level works that ignore important global sentiment information in other sentences. Given the rise of advanced applications, more fine-grained corpora are needed, even at the sentence level. The authors aim to create a fine-grained corpus for Chinese sentiment analysis, and more importantly, explore new sentiment analysis tasks by analyzing the annotated corpus. The proposed fine-grained annotation scheme not only introduces cross-sentence and global sentiment information (such as "target entity"\´) but also includes new sentence-level elements (such as "implicit aspect"). Based on this scheme, this corpus can provide a more fine-grained platform for researchers to study algorithms for advanced applications. In addition, an in-depth analysis on the annotated corpus is made and several important but ignored tasks, such as the target-aspect pair extraction task, are explored, which can give useful hints about future directions.
Keywords :
information retrieval; natural language processing; social networking (online); text analysis; Chinese sentiment analysis; annotated corpora; comment writing; cross-sentence information; fine-grained corpus creation; global sentiment information; implicit aspect; opinion analysis; sentence-level elements; social media; target entity; target-aspect pair extraction task; Mobile handsets; Motion pictures; Performance evaluation; Product design; Sentiment analysis; Social network services; Target recognition; Writing; XML; Chinese; corpus for sentiment analysis; fine-grained; intelligent systems; social media;
fLanguage :
English
Journal_Title :
Intelligent Systems, IEEE
Publisher :
ieee
ISSN :
1541-1672
Type :
jour
DOI :
10.1109/MIS.2014.33
Filename :
6824679
Link To Document :
بازگشت