• DocumentCode
    1639670
  • Title

    Genetic algorithm based feature selection for mass spectrometry data

  • Author

    Li, Yifeng ; Liu, Yihui ; Bai, Li

  • Author_Institution
    Sch. of Inf. Sci. & Technol., Inst. of Intell. Inf. Process., Jinan
  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Mass spectrometry technique is a revolutionary tool for diagnosing early stage cancer by analyzing protein mass spectra, and for detecting biomarkers. But because of the high dimensionality of the data, feature selection is a necessary procedure before classification and analysis. In this paper we present a genetic algorithm for feature selection for prostate protein mass spectrometry data. An elitism coupled with rank based stochastic universal sampling selection strategy, uniform crossover operation, and a uniform mutation with adaptive mutation rate strategy are used. Two fitness functions are defined for the genetic algorithm: one is a multivariate filter measurement and the other is a wrapper measurement. Our experiments show that the wrapper-based genetic algorithm outperforms all the other feature selection methods presented here. The multivariate filter-based genetic algorithm also yields better performance than transformed methods, sequential selection methods, and univariate filter methods.
  • Keywords
    biological organs; biomedical measurement; cancer; feature extraction; genetic algorithms; mass spectroscopic chemical analysis; medical diagnostic computing; molecular biophysics; pattern classification; proteins; spectroscopy computing; stochastic processes; tumours; adaptive mutation rate strategy; biomarker detection; early stage cancer diagnosis; feature selection; multivariate filter measurement; pattern classification; prostate protein mass spectrometry technique; protein mass spectra; stochastic universal sampling selection strategy; uniform crossover operation; uniform mutation; wrapper measurement; wrapper-based genetic algorithm; Biomarkers; Cancer; Classification algorithms; Filters; Genetic algorithms; Genetic mutations; Information processing; Information science; Mass spectroscopy; Proteins;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    BioInformatics and BioEngineering, 2008. BIBE 2008. 8th IEEE International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4244-2844-1
  • Electronic_ISBN
    978-1-4244-2845-8
  • Type

    conf

  • DOI
    10.1109/BIBE.2008.4696664
  • Filename
    4696664