Title :
On using prototype reduction schemes and classifier fusion strategies to optimize kernel-based nonlinear subspace methods
Author :
Kim, Sang-Woon ; Oommen, B. John
Author_Institution :
Dept. of Comput. Sci. & Eng., Myongji Univ., Yongin, South Korea
fDate :
3/1/2005 12:00:00 AM
Abstract :
In kernel-based nonlinear subspace (KNS) methods, the length of the projections onto the principal component directions in the feature space, is computed using a kernel matrix, K, whose dimension is equivalent to the number of sample data points. Clearly this is problematic, especially, for large data sets. In this paper, we solve this problem by subdividing the data into smaller subsets, and utilizing a prototype reduction scheme (PRS) as a preprocessing module, to yield more refined representative prototypes. Thereafter, a classifier fusion strategy (CFS) is invoked as a postprocessing module, to combine the individual KNS classification results to derive a consensus decision. Essentially, the PRS is used to yield computational advantage, and the CFS, in turn, is used to compensate for the decreased efficiency caused by the data set division. Our experimental results demonstrate that the proposed mechanism significantly reduces the prototype extraction time as well as the computation time without sacrificing the classification accuracy. The results especially demonstrate a significant computational advantage for large data sets within a parallel processing philosophy.
Keywords :
data reduction; feature extraction; matrix algebra; pattern classification; principal component analysis; sensor fusion; set theory; classifier fusion strategy; computation time reduction; consensus decision; data set division; feature space extraction; kernel based nonlinear subspace methods; kernel matrix; large data sets; postprocessing module; preprocessing module; principal component directions; prototype extraction time; prototype reduction scheme; sample data points; subsets; Concurrent computing; Data mining; Eigenvalues and eigenfunctions; Kernel; Optimization methods; Parallel processing; Principal component analysis; Prototypes; Senior members; Vectors; Index Terms- Kernel Principal Component Analysis (kPCA); classifier fusion strategies (CFS).; kernel-based nonlinear subspace (KNS) method; prototype reduction schemes (PRS); Algorithms; Arrhythmias, Cardiac; Artificial Intelligence; Cluster Analysis; Computer Simulation; Diagnosis, Computer-Assisted; Humans; Information Storage and Retrieval; Models, Biological; Models, Statistical; Nonlinear Dynamics; Pattern Recognition, Automated; Pilot Projects; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2005.60