DocumentCode :
2492055
Title :
Manifolds for training set selection through outlier detection
Author :
Tolba, A.S.
Author_Institution :
Fac. of Comput. Studies, Arab Open Univ., Safat, Kuwait
fYear :
2010
fDate :
15-18 Dec. 2010
Firstpage :
467
Lastpage :
472
Abstract :
The effect of the training set on supervised classifier performance has always been overlooked. This paper provides a new approach for training set cleaning based on the concept of outlier detection to help build sound class models during the training of supervised classifiers. Outliers in a training set result in classifier performance deterioration and slow convergence. For training set cleaning, the proposed technique transforms non-linear relationships between high dimensional patterns into a simple geometric relationship. The Isometric pattern Mapping (ISOMAP) is used to embed the high dimensional training set patterns to a low-dimensional manifold. The dispersion of mapped points will be used to locate the outliers and measure their outlyingness. Several experiments on real data sets show the promising performance of the proposed technique.
Keywords :
data mining; feature extraction; pattern classification; pattern matching; geometric relationship; high dimensional pattern; isometric pattern mapping; outlier detection; supervised classifier performance; training set selection; Face; Classifier Performance; Isometric Mapping; Manifolds; Outlier Detection; Sensitivity Analysis; Training set cleaning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology (ISSPIT), 2010 IEEE International Symposium on
Conference_Location :
Luxor
Print_ISBN :
978-1-4244-9992-2
Type :
conf
DOI :
10.1109/ISSPIT.2010.5711754
Filename :
5711754
Link To Document :
بازگشت