Title :
Feature subset selection via multi-objective genetic algorithm
Author :
Lac, Hao C. ; Stacey, Deborah A.
Author_Institution :
Comput. & Inf. Sci., Guelph Univ., Ont., Canada
fDate :
31 July-4 Aug. 2005
Abstract :
Real-world datasets tend to be complex, large in size, and may contain many irrelevant features. Eliminating such irrelevant features can significantly improve the performance of a data mining algorithm. In this paper, we propose a multi-objective genetic algorithm that finds a set of Pareto-optimal feature subsets that works as a wrapper around a standard back-propagation algorithm. We also introduce a novel mechanism called the least-crowded selection algorithm that maximizes the diversity of the solutions returned by the algorithm. We justify the proposed method by theoretically and empirically comparing it to the backpropagation neural network and the simple genetic algorithm for feature selection.
Keywords :
Pareto optimisation; backpropagation; data mining; genetic algorithms; Pareto-optimal feature subset; backpropagation neural network; data mining; feature selection; feature subset selection; least-crowded selection; multiobjective genetic algorithm; Backpropagation algorithms; Computer vision; Data mining; Error analysis; Filters; Genetic algorithms; Information science; Minimization methods; Neural networks; Noise reduction;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1556070