• DocumentCode
    2835482
  • Title

    An improved KNN text categorization on skew sort condition

  • Author

    Haifeng, Liu ; Shousheng, Liu ; Zhan, Su

  • Author_Institution
    Inst. of Sci., PLA Univ. of Sci. & Technol., Nanjing, China
  • Volume
    7
  • fYear
    2010
  • fDate
    22-24 Oct. 2010
  • Abstract
    KNN is one of most frequent used methods for text categorization. The feature high-dimension and skew of sort distribution will impact the performance of the classifier. An improved KNN based on skew sort condition is introduced in this paper for solving the problem that the big swatch sort with more texts is easy to be selected when conducting the K neighbor selection. Firstly, text feature selection is conducted by an improved information gain method for more efficient using the categorization distribution information in the sample training set. Then an improved KNN classifier based on the sort is used for categorization, which can solve the problem that big swatch sort is selected in training set. The experiment shows this method has improved the KNN classification performance.
  • Keywords
    pattern classification; text analysis; KNN classifier; KNN text categorization; big swatch sort; skew sort condition; Information entropy; Variable speed drives; KNN; feature reduction; feature selection; text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Application and System Modeling (ICCASM), 2010 International Conference on
  • Conference_Location
    Taiyuan
  • Print_ISBN
    978-1-4244-7235-2
  • Electronic_ISBN
    978-1-4244-7237-6
  • Type

    conf

  • DOI
    10.1109/ICCASM.2010.5620491
  • Filename
    5620491