• DocumentCode
    1859810
  • Title

    Dimensionality reduction for bio-medical spectra

  • Author

    Bowman, Christopher ; Baumgartner, Richard ; Somorjai, R.

  • Author_Institution
    Inst. for Biodiagnostics, Nat. Res. Council of Canada, Winnipeg, Man., Canada
  • Volume
    2
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    1077
  • Abstract
    The classification problem for high dimensional data (for example near infrared spectra of bio-fluids) is a challenging, cornerstone problem in bio-informatics. The problems in the field possess many measured, highly correlated variables, which typically come from digitization of continuous signals, and relatively few distinct samples, with the number of variables often far exceeding the number of observations. Fortunately, in practice, the data are often restricted or nearly restricted to a relatively low dimensional manifold in feature space. We will compare several techniques both linear and nonlinear for identifying this manifold, including local and global principal component analysis, and a novel implementation of the (nonlinear) Whitney reduction network. The intrinsic dimension of the data manifold will be verified through an independent validation set.
  • Keywords
    eigenvalues and eigenfunctions; feature extraction; infrared spectra; medical signal processing; principal component analysis; bio-fluids; bio-informatics; bio-medical spectra; classification problem; continuous signals; digitization; feature space; global principal component analysis; high dimensional data; independent validation set; local principal component analysis; near infrared spectra; nonlinear Whitney reduction network; Covariance matrix; Digital images; Fluid dynamics; Geometry; Image analysis; Image reconstruction; Neural networks; Numerical simulation; Principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering, 2002. IEEE CCECE 2002. Canadian Conference on
  • ISSN
    0840-7789
  • Print_ISBN
    0-7803-7514-9
  • Type

    conf

  • DOI
    10.1109/CCECE.2002.1013096
  • Filename
    1013096