Title :
A Hybrid Method of Feature Selection for Chinese Text Sentiment Classification
Author :
Wang, Suge ; Wei, Yingjie ; Li, Deyu ; Zhang, Wu ; Li, Wei
Author_Institution :
Shanghai Univ., Shanghai
Abstract :
Text sentiment classification can be extensively applied to information retrieval, text filtering, online tracking evaluation, the diagnoses of public opinions and chat systems. In this paper, a kinds of hybrid methods, based on category distinguishing ability of words and information gain, is adopted to feature selection. For examining the impact of varying the feature dimension to classification results, using corpus of car reviews, feature dimensions, 1000, 2000 and 3000 are adopted in our experiments. The experiments classification results indicate that the hybrid methods are best with feature dimension equal to 3000, and the result by using hybrid methods is superior to that by directly using information gain. In our experiments F value can achieve over 80%. Finally, some mistake examples are employed to indicate the limitations of methods in this paper.
Keywords :
classification; feature extraction; natural languages; text analysis; Chinese text sentiment classification; chat system; feature selection; information retrieval; online tracking evaluation; text filtering; Information filtering; Information filters; Information retrieval; Machine learning; Mathematics; Natural language processing; Support vector machine classification; Support vector machines; Tagging; Text categorization;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
DOI :
10.1109/FSKD.2007.49