DocumentCode :
3660524
Title :
Renovating word vectors to build Chinese sentiment lexicon
Author :
Xinyu Guan;Qinke Peng;Jing Zhang;Xiaokang Zhang
Author_Institution :
Systems Engineering Institute, School of Electronic and Information Engineering, Xi´an Jiaotong University, 710049, China
fYear :
2015
Firstpage :
2977
Lastpage :
2982
Abstract :
Sentiment lexicon is the core of many academic and commercial sentiment analysis system. But compared with the plentiful English sentiment resources, Chinese sentiment lexicon is scarce which limits the application of lexicon-based method in Chinese sentiment analysis. In this work, we proposed a novel architecture to produce Chinese sentiment lexicon. At first, we trained word vectors from distributional information of words in large corpora. The capacity of these word vectors in sentiment analysis is confined because of the noisy information in their feature space. So a feature selection method was used to renovate the word vectors. After that, the renovated word vectors was used with a similarity-based method to produce the Chinese sentiment lexicon. To evaluate the usefulness of our lexicon, both qualitative and quantitative experiments were designed. The results show that it outperforms previously studied lexicons and indicate that our sentiment lexicon could be used as an important resource for sentiment classification tasks.
Keywords :
"Sentiment analysis","Semantics","Classification algorithms","Training","Logistics","Tag clouds","Computer architecture"
Publisher :
ieee
Conference_Titel :
Information and Automation, 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/ICInfA.2015.7279798
Filename :
7279798
Link To Document :
بازگشت