Title :
Genetic programming for simultaneous feature selection and classifier design
Author :
Muni, Durga Prasad ; Pal, Nikhil R. ; Das, Jyotirmoy
Author_Institution :
Electron. & Commun. Sci. Unit, Indian Stat. Inst., Calcutta, India
Abstract :
This paper presents an online feature selection algorithm using genetic programming (GP). The proposed GP methodology simultaneously selects a good subset of features and constructs a classifier using the selected features. For a c-class problem, it provides a classifier having c trees. In this context, we introduce two new crossover operations to suit the feature selection process. As a byproduct, our algorithm produces a feature ranking scheme. We tested our method on several data sets having dimensions varying from 4 to 7129. We compared the performance of our method with results available in the literature and found that the proposed method produces consistently good results. To demonstrate the robustness of the scheme, we studied its effectiveness on data sets with known (synthetically added) redundant/bad features.
Keywords :
feature extraction; genetic algorithms; learning (artificial intelligence); pattern classification; c-class problem; evolutionary algorithm; feature classifier; feature ranking scheme; genetic programming; online feature selection algorithm; Algorithm design and analysis; Classification tree analysis; Error analysis; Evolutionary computation; Filters; Genetic programming; Neural networks; Robustness; Support vector machines; Testing; Classification; classifier design; evolutionary algorithm; feature selection; genetic programming; Algorithms; Artificial Intelligence; Computer Simulation; Decision Support Techniques; Information Storage and Retrieval; Models, Genetic; Pattern Recognition, Automated;
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMCB.2005.854499