• DocumentCode
    3326902
  • Title

    A new feature selection algorithm in text categorization

  • Author

    Zhao, Wei ; Wang, Yafei ; Li, Dan

  • Author_Institution
    Coll. of Inf. Technol., Jilin Agric. Univ., Changchun, China
  • Volume
    1
  • fYear
    2010
  • fDate
    5-7 May 2010
  • Firstpage
    146
  • Lastpage
    149
  • Abstract
    A major problem with text classification problems is the high dimensionality of the feature space. This paper investigates how genetic algorithm and k-means algorithm can help select relevant features in text classification. which uses the genetic algorithm (GA) optimization features to implement global searching, and uses k-means algorithm to selection operation to control the scope of the search, ensure the validity of each gene and the speed of convergence. Our experimental results show that the combination of GA and k-means algorithm is quite useful in reduce the high feature dimension, and improved accuracy and efficiency for text classification.
  • Keywords
    genetic algorithms; pattern clustering; text analysis; feature selection algorithm; genetic algorithm; global searching; k-means algorithm; text categorization; text classification problems; Automatic control; Automation; Communication system control; Computer science; Educational institutions; Genetic algorithms; Information technology; Mathematical model; Space technology; Text categorization; feature selection; genetic algorithm; k-means algorithm; text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communication Control and Automation (3CA), 2010 International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-5565-2
  • Type

    conf

  • DOI
    10.1109/3CA.2010.5533870
  • Filename
    5533870