Title :
A Feature Selection Method Based on Information Gain and Genetic Algorithm
Author_Institution :
Dept. of Inf. Sci. & Technol., Shandong Univ. of Political Sci. & Law, Jinan, China
Abstract :
With the rapid development of the Computer Science and Technology, It has become a major problem for the users that how to quickly find useful or needed information. Text categorization can help people to solve this question. The feature selection method has become one of the most critical techniques in the field of the text automatic categorization. A new method of the text feature selection based on Information Gain and Genetic Algorithm is proposed in this paper. This method chooses the feature based on information gain with the frequency of items. Meanwhile, for the information filtering systems, this method has been improved fitness function to fully consider the characteristics of weight, text and vector similarity dimension, etc. The experiment has proved that the method can reduce the dimension of text vector and improve the precision of text classification.
Keywords :
genetic algorithms; information filtering; pattern classification; text analysis; computer science; computer technology; feature selection method; genetic algorithm; information filtering system; information gain; text categorization; text classification; text similarity dimension; vector similarity dimension; weight similarity dimension; Accuracy; Classification algorithms; Genetic algorithms; Genetics; Information filtering; Text categorization; Feature Selection; Genetic Algorithm; Information Gain; information filtering;
Conference_Titel :
Computer Science and Electronics Engineering (ICCSEE), 2012 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-0689-8
DOI :
10.1109/ICCSEE.2012.97