• DocumentCode
    1899075
  • Title

    A Feature Selection Method Based on Information Gain and Genetic Algorithm

  • Author

    Lei, Shang

  • Author_Institution
    Dept. of Inf. Sci. & Technol., Shandong Univ. of Political Sci. & Law, Jinan, China
  • Volume
    2
  • fYear
    2012
  • fDate
    23-25 March 2012
  • Firstpage
    355
  • Lastpage
    358
  • Abstract
    With the rapid development of the Computer Science and Technology, It has become a major problem for the users that how to quickly find useful or needed information. Text categorization can help people to solve this question. The feature selection method has become one of the most critical techniques in the field of the text automatic categorization. A new method of the text feature selection based on Information Gain and Genetic Algorithm is proposed in this paper. This method chooses the feature based on information gain with the frequency of items. Meanwhile, for the information filtering systems, this method has been improved fitness function to fully consider the characteristics of weight, text and vector similarity dimension, etc. The experiment has proved that the method can reduce the dimension of text vector and improve the precision of text classification.
  • Keywords
    genetic algorithms; information filtering; pattern classification; text analysis; computer science; computer technology; feature selection method; genetic algorithm; information filtering system; information gain; text categorization; text classification; text similarity dimension; vector similarity dimension; weight similarity dimension; Accuracy; Classification algorithms; Genetic algorithms; Genetics; Information filtering; Text categorization; Feature Selection; Genetic Algorithm; Information Gain; information filtering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Electronics Engineering (ICCSEE), 2012 International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4673-0689-8
  • Type

    conf

  • DOI
    10.1109/ICCSEE.2012.97
  • Filename
    6188038