• DocumentCode
    2688305
  • Title

    A K-Nearest Neighbor Algorithm based on cluster in text classification

  • Author

    Wang, Chun-Yan ; Yan, Yu-Guang ; Zhang, Kuo ; Li, Jian-Gang

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Changchun Normal Coll., Changchun, China
  • Volume
    1
  • fYear
    2010
  • fDate
    24-26 Aug. 2010
  • Firstpage
    225
  • Lastpage
    228
  • Abstract
    The K-Nearest Neighbor Algorithm (K-NN) is an important approach for automatic text classification. In this paper, cluster was applied In order to overcome the disadvantages of the traditional K-NN algorithm. First Clustering was utilized in training set through an improved K-mean approach to select the most representative samples as cluster center. Then we compute the comparability between the testing samples and the central vector of each cluster. A K-NN algorithm based on cluster was presented. The experiment results verify that this classification algorithm is much faster than the traditional K-NN algorithm, and it can raise the accuracy.
  • Keywords
    pattern classification; pattern clustering; text analysis; automatic text classification; cluster center; k-means approach; k-nearest neighbor algorithm; training set; Artificial neural networks; Biological system modeling; cluster; k-Nearest Neighbor; text classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer, Mechatronics, Control and Electronic Engineering (CMCE), 2010 International Conference on
  • Conference_Location
    Changchun
  • Print_ISBN
    978-1-4244-7957-3
  • Type

    conf

  • DOI
    10.1109/CMCE.2010.5610477
  • Filename
    5610477