• DocumentCode
    468343
  • Title

    A Hybrid Method of Feature Selection for Chinese Text Sentiment Classification

  • Author

    Wang, Suge ; Wei, Yingjie ; Li, Deyu ; Zhang, Wu ; Li, Wei

  • Author_Institution
    Shanghai Univ., Shanghai
  • Volume
    3
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    435
  • Lastpage
    439
  • Abstract
    Text sentiment classification can be extensively applied to information retrieval, text filtering, online tracking evaluation, the diagnoses of public opinions and chat systems. In this paper, a kinds of hybrid methods, based on category distinguishing ability of words and information gain, is adopted to feature selection. For examining the impact of varying the feature dimension to classification results, using corpus of car reviews, feature dimensions, 1000, 2000 and 3000 are adopted in our experiments. The experiments classification results indicate that the hybrid methods are best with feature dimension equal to 3000, and the result by using hybrid methods is superior to that by directly using information gain. In our experiments F value can achieve over 80%. Finally, some mistake examples are employed to indicate the limitations of methods in this paper.
  • Keywords
    classification; feature extraction; natural languages; text analysis; Chinese text sentiment classification; chat system; feature selection; information retrieval; online tracking evaluation; text filtering; Information filtering; Information filters; Information retrieval; Machine learning; Mathematics; Natural language processing; Support vector machine classification; Support vector machines; Tagging; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.49
  • Filename
    4406276