DocumentCode :
2053673
Title :
Use of neighborhood and stratification approaches to speed up instance selection algorithm
Author :
Ros, Frédéric ; Harba, Rachid ; Piclin, Nadege ; Pintore, Marco
Author_Institution :
Inst. Prisme, Orleans Univ., Orleans, France
fYear :
2010
fDate :
7-10 Dec. 2010
Firstpage :
320
Lastpage :
325
Abstract :
This paper investigates a method for instance selection in the context of supervised classification adapted to large databases. Based on the scale up concept, the method reduces the time required to perform the selection procedure by enabling the application of known condensation instance techniques to only small data sets instead of the whole set. The novelty of our approach relies in the way of hybridizing neighborhood and stratification approaches. The key idea is to consider instances found out for a given strata to generate sub populations for the other strata representing critical regions of the feature space. Experiments performed with various data sets revealed the effectiveness and applicability of the proposed approach.
Keywords :
data mining; learning (artificial intelligence); pattern classification; set theory; condensation instance techniques; feature space; instance selection algorithm; neighborhood approaches; stratification approaches; supervised classification; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Databases; Nearest neighbor searches; Prototypes; Training; clustering algorithm; instance selection; k-nearest neighbors; supervised classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Soft Computing and Pattern Recognition (SoCPaR), 2010 International Conference of
Conference_Location :
Paris
Print_ISBN :
978-1-4244-7897-2
Type :
conf
DOI :
10.1109/SOCPAR.2010.5686648
Filename :
5686648
Link To Document :
بازگشت