Title :
Fast feature selection with genetic algorithms: a filter approach
Author :
Lanzi, Pier Luca
Author_Institution :
Dipartimento di Elettronica e Inf., Politecnico di Milano, Italy
Abstract :
The goal of the feature selection process is, given a dataset described by n attributes (features), to find the minimum number m of relevant attributes which describe the data as well as the original set of attributes do. Genetic algorithms have been used to implement feature selection algorithms. Previous algorithms presented in the literature used the predictive accuracy of a specific learning algorithm as the fitness function to maximize over the space of possible feature subsets. Such an approach to feature selection requires a large amount of CPU time to reach a good solution on large datasets. This paper presents a genetic algorithm for feature selection which improves previous results presented in the literature for genetic-based feature selection. It is independent of a specific learning algorithm and requires less CPU time to reach a relevant subset of features. Reported experiments show that the proposed algorithm is at least ten times faster than a standard genetic algorithm for feature selection without a loss of predictive accuracy when a learning algorithm is applied to reduced data
Keywords :
data analysis; data description; database theory; deductive databases; genetic algorithms; learning (artificial intelligence); CPU time; data analysis; data description; dataset; feature selection; filter approach; fitness function; genetic algorithms; large datasets; learning algorithm; predictive accuracy; reduced data; relevant attributes; Accuracy; Biological system modeling; Data analysis; Data mining; Filters; Genetic algorithms; History; Predictive models; Risk management; Space exploration;
Conference_Titel :
Evolutionary Computation, 1997., IEEE International Conference on
Conference_Location :
Indianapolis, IN
Print_ISBN :
0-7803-3949-5
DOI :
10.1109/ICEC.1997.592369