DocumentCode
468343
Title
A Hybrid Method of Feature Selection for Chinese Text Sentiment Classification
Author
Wang, Suge ; Wei, Yingjie ; Li, Deyu ; Zhang, Wu ; Li, Wei
Author_Institution
Shanghai Univ., Shanghai
Volume
3
fYear
2007
fDate
24-27 Aug. 2007
Firstpage
435
Lastpage
439
Abstract
Text sentiment classification can be extensively applied to information retrieval, text filtering, online tracking evaluation, the diagnoses of public opinions and chat systems. In this paper, a kinds of hybrid methods, based on category distinguishing ability of words and information gain, is adopted to feature selection. For examining the impact of varying the feature dimension to classification results, using corpus of car reviews, feature dimensions, 1000, 2000 and 3000 are adopted in our experiments. The experiments classification results indicate that the hybrid methods are best with feature dimension equal to 3000, and the result by using hybrid methods is superior to that by directly using information gain. In our experiments F value can achieve over 80%. Finally, some mistake examples are employed to indicate the limitations of methods in this paper.
Keywords
classification; feature extraction; natural languages; text analysis; Chinese text sentiment classification; chat system; feature selection; information retrieval; online tracking evaluation; text filtering; Information filtering; Information filters; Information retrieval; Machine learning; Mathematics; Natural language processing; Support vector machine classification; Support vector machines; Tagging; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location
Haikou
Print_ISBN
978-0-7695-2874-8
Type
conf
DOI
10.1109/FSKD.2007.49
Filename
4406276
Link To Document