Title :
Manifolds for training set selection through outlier detection
Author_Institution :
Fac. of Comput. Studies, Arab Open Univ., Safat, Kuwait
Abstract :
The effect of the training set on supervised classifier performance has always been overlooked. This paper provides a new approach for training set cleaning based on the concept of outlier detection to help build sound class models during the training of supervised classifiers. Outliers in a training set result in classifier performance deterioration and slow convergence. For training set cleaning, the proposed technique transforms non-linear relationships between high dimensional patterns into a simple geometric relationship. The Isometric pattern Mapping (ISOMAP) is used to embed the high dimensional training set patterns to a low-dimensional manifold. The dispersion of mapped points will be used to locate the outliers and measure their outlyingness. Several experiments on real data sets show the promising performance of the proposed technique.
Keywords :
data mining; feature extraction; pattern classification; pattern matching; geometric relationship; high dimensional pattern; isometric pattern mapping; outlier detection; supervised classifier performance; training set selection; Face; Classifier Performance; Isometric Mapping; Manifolds; Outlier Detection; Sensitivity Analysis; Training set cleaning;
Conference_Titel :
Signal Processing and Information Technology (ISSPIT), 2010 IEEE International Symposium on
Conference_Location :
Luxor
Print_ISBN :
978-1-4244-9992-2
DOI :
10.1109/ISSPIT.2010.5711754