Title :
Dimensionality reduction for bio-medical spectra
Author :
Bowman, Christopher ; Baumgartner, Richard ; Somorjai, R.
Author_Institution :
Inst. for Biodiagnostics, Nat. Res. Council of Canada, Winnipeg, Man., Canada
Abstract :
The classification problem for high dimensional data (for example near infrared spectra of bio-fluids) is a challenging, cornerstone problem in bio-informatics. The problems in the field possess many measured, highly correlated variables, which typically come from digitization of continuous signals, and relatively few distinct samples, with the number of variables often far exceeding the number of observations. Fortunately, in practice, the data are often restricted or nearly restricted to a relatively low dimensional manifold in feature space. We will compare several techniques both linear and nonlinear for identifying this manifold, including local and global principal component analysis, and a novel implementation of the (nonlinear) Whitney reduction network. The intrinsic dimension of the data manifold will be verified through an independent validation set.
Keywords :
eigenvalues and eigenfunctions; feature extraction; infrared spectra; medical signal processing; principal component analysis; bio-fluids; bio-informatics; bio-medical spectra; classification problem; continuous signals; digitization; feature space; global principal component analysis; high dimensional data; independent validation set; local principal component analysis; near infrared spectra; nonlinear Whitney reduction network; Covariance matrix; Digital images; Fluid dynamics; Geometry; Image analysis; Image reconstruction; Neural networks; Numerical simulation; Principal component analysis;
Conference_Titel :
Electrical and Computer Engineering, 2002. IEEE CCECE 2002. Canadian Conference on
Print_ISBN :
0-7803-7514-9
DOI :
10.1109/CCECE.2002.1013096