Title :
Use of neighborhood and stratification approaches to speed up instance selection algorithm
Author :
Ros, Frédéric ; Harba, Rachid ; Piclin, Nadege ; Pintore, Marco
Author_Institution :
Inst. Prisme, Orleans Univ., Orleans, France
Abstract :
This paper investigates a method for instance selection in the context of supervised classification adapted to large databases. Based on the scale up concept, the method reduces the time required to perform the selection procedure by enabling the application of known condensation instance techniques to only small data sets instead of the whole set. The novelty of our approach relies in the way of hybridizing neighborhood and stratification approaches. The key idea is to consider instances found out for a given strata to generate sub populations for the other strata representing critical regions of the feature space. Experiments performed with various data sets revealed the effectiveness and applicability of the proposed approach.
Keywords :
data mining; learning (artificial intelligence); pattern classification; set theory; condensation instance techniques; feature space; instance selection algorithm; neighborhood approaches; stratification approaches; supervised classification; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Databases; Nearest neighbor searches; Prototypes; Training; clustering algorithm; instance selection; k-nearest neighbors; supervised classification;
Conference_Titel :
Soft Computing and Pattern Recognition (SoCPaR), 2010 International Conference of
Conference_Location :
Paris
Print_ISBN :
978-1-4244-7897-2
DOI :
10.1109/SOCPAR.2010.5686648