DocumentCode
2492055
Title
Manifolds for training set selection through outlier detection
Author
Tolba, A.S.
Author_Institution
Fac. of Comput. Studies, Arab Open Univ., Safat, Kuwait
fYear
2010
fDate
15-18 Dec. 2010
Firstpage
467
Lastpage
472
Abstract
The effect of the training set on supervised classifier performance has always been overlooked. This paper provides a new approach for training set cleaning based on the concept of outlier detection to help build sound class models during the training of supervised classifiers. Outliers in a training set result in classifier performance deterioration and slow convergence. For training set cleaning, the proposed technique transforms non-linear relationships between high dimensional patterns into a simple geometric relationship. The Isometric pattern Mapping (ISOMAP) is used to embed the high dimensional training set patterns to a low-dimensional manifold. The dispersion of mapped points will be used to locate the outliers and measure their outlyingness. Several experiments on real data sets show the promising performance of the proposed technique.
Keywords
data mining; feature extraction; pattern classification; pattern matching; geometric relationship; high dimensional pattern; isometric pattern mapping; outlier detection; supervised classifier performance; training set selection; Face; Classifier Performance; Isometric Mapping; Manifolds; Outlier Detection; Sensitivity Analysis; Training set cleaning;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Information Technology (ISSPIT), 2010 IEEE International Symposium on
Conference_Location
Luxor
Print_ISBN
978-1-4244-9992-2
Type
conf
DOI
10.1109/ISSPIT.2010.5711754
Filename
5711754
Link To Document