• DocumentCode
    2463702
  • Title

    A Method of Text Feature Extraction Based on Weighted Scatter Difference

  • Author

    Haifeng, Liu ; Zhan, Su ; Zeqing, Yao ; Xueren, Zhang

  • Author_Institution
    Inst. of Sci., PLA Univ. of Sci. & Technol., Nanjing, China
  • Volume
    3
  • fYear
    2010
  • fDate
    16-17 Dec. 2010
  • Firstpage
    83
  • Lastpage
    86
  • Abstract
    Feature reduction is one of the core technologies of automatic text categorization. As for the scatter difference criterion, poor categorization effect is made when the between-class distance is small and the class density is high. In order to solve this problem, a weighted method based on the sample distribution is shown in the paper, which will make the between-class and within-class scatter matrixes with poor scatter be weighted, to enhance the categorization ability after dimensional reduction and to improve the dimensional reduction effect of linear feature extraction method based on scatter difference. The following experiment tells us that this method is superior to the original maximum scatter difference method in precision rate and recall rate.
  • Keywords
    feature extraction; information retrieval; text analysis; automatic text categorization; core technologies; feature reduction; text feature extraction method; weighted scatter difference; Covariance matrix; Feature extraction; Imaging; Support vector machine classification; Text categorization; Training; Vectors; feature extraction; feature reduction; scatter difference; text classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems (GCIS), 2010 Second WRI Global Congress on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-9247-3
  • Type

    conf

  • DOI
    10.1109/GCIS.2010.49
  • Filename
    5709328