Title :
A genetic based wrapper feature selection approach using Nearest Neighbour Distance Matrix
Author :
Sainin, Mohd Shamrie ; Alfred, Rayner
Author_Institution :
Dept. of Comput. Sci., Univ. Utara Malaysia, Sintok, Malaysia
Abstract :
Feature selection for data mining optimization receives quite a high demand especially on high-dimensional feature vectors of a data. Feature selection is a method used to select the best feature (or combination of features) for the data in order to achieve similar or better classification rate. Currently, there are three types of feature selection methods: filter, wrapper and embedded. This paper describes a genetic based wrapper approach that optimizes feature selection process embedded in a classification technique called a supervised Nearest Neighbour Distance Matrix (NNDM). This method is implemented and tested on several datasets obtained from the UCI Machine Learning Repository and other datasets. The results demonstrate a significant impact on the predictive accuracy for feature selection combined with the supervised NNDM in classifying new instances. Therefore it can be used in other applications that require feature dimension reduction such as image and bioinformatics classifications.
Keywords :
data mining; genetic algorithms; learning (artificial intelligence); NNDM; UCI machine learning repository; data mining optimization; genetic based wrapper feature selection approach; nearest neighbour distance matrix; Accuracy; Classification algorithms; Data mining; Genetic algorithms; Genetics; Optimization; Training; classification; data mining; data mining optimization; distance matrix; feature selection; genetic algorithm; machine learning; nearest neighbour;
Conference_Titel :
Data Mining and Optimization (DMO), 2011 3rd Conference on
Conference_Location :
Putrajaya
Print_ISBN :
978-1-61284-211-0
Electronic_ISBN :
2155-6938
DOI :
10.1109/DMO.2011.5976534