Title :
A GA-based feature selection and ensemble learning for high-dimensional datasets
Author :
Xia, Pei-yong ; Ding, Xiang-Qian ; Jiang, Bai-ning
Author_Institution :
Dept. of Comput. Sci., Ocean Univ. of China, Qingdao, China
Abstract :
When dealing with high-dimensional datasets with fewer samples, feature selection and ensemble learning are two effective strategies. In this paper, we focus our attention on genetic algorithm based feature selection for ensemble learning. We use an improved GA algorithm (IGA) to reduce the dimensionality of the feature space, and then evaluate using bagging and Ada-Boost constructed by the reduced features. Experimental results on several UCI datasets demonstrate that the improved GA-based feature selection algorithm (IGAFS) is often able to obtain a better feature subset when compared with the standard GA-based feature selection algorithm (SGAFS). Our experiments also indicate that ensemble learning using IGAFS is more accuracy than employing SGAFS and the whole feature space in general conditions.
Keywords :
genetic algorithms; learning (artificial intelligence); pattern classification; Ada-Boost; GA-based feature selection; bagging; ensemble learning; genetic algorithm based feature selection; high-dimensional datasets; Bagging; Boosting; Computational complexity; Computer science; Cybernetics; Filters; Genetic algorithms; Machine learning; Oceans; Pattern recognition; Ada-Boost; Bagging; Ensemble learning; Feature Selection; Genetic algorithms;
Conference_Titel :
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location :
Baoding
Print_ISBN :
978-1-4244-3702-3
Electronic_ISBN :
978-1-4244-3703-0
DOI :
10.1109/ICMLC.2009.5212542