• DocumentCode
    3369819
  • Title

    Improved GA-based text clustering algorithm

  • Author

    Shi, Kansheng ; Li, Lemin ; He, Jie ; Zhang, Naitong ; Liu, Haitao ; Song, Wentao

  • Author_Institution
    Shanghai Jiaotong Univ., Shanghai, China
  • fYear
    2011
  • fDate
    28-30 Oct. 2011
  • Firstpage
    675
  • Lastpage
    679
  • Abstract
    The traditional K-means algorithm is sensitive to the initial points and easy to fall into local optimum. To avoid this kind of flaw, an improved GA-based text clustering algorithm CGHCM is proposed. The new algorithm is proven effective to avoid falling into local optimum and obtains better clustering results.
  • Keywords
    genetic algorithms; pattern clustering; text analysis; unsupervised learning; GA-based text clustering algorithm; K-means algorithm; genetic algorithm; unsupervised machine learning method; Accuracy; Algorithm design and analysis; Biological cells; Clustering algorithms; Genetic algorithms; Mathematical model; Vectors; GA; K-means; Similarity measurement; VSM; text clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Broadband Network and Multimedia Technology (IC-BNMT), 2011 4th IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-61284-158-8
  • Type

    conf

  • DOI
    10.1109/ICBNMT.2011.6156021
  • Filename
    6156021