• DocumentCode
    667339
  • Title

    Stability of feature selection algorithms for classification in high-throughput genomics datasets

  • Author

    Moulos, Panagiotis ; Kanaris, Ioannis ; Bontempi, Gianluca

  • Author_Institution
    Inst. of Mol. Biol. & Genetics, BSRC Alexander Fleming, Vari, Greece
  • fYear
    2013
  • fDate
    10-13 Nov. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    A major goal of the application of Machine Learning techniques to high-throughput genomics data (e.g. DNA microarrays or RNA-Seq), is the identification of “gene signatures”. These signatures can be used to discriminate among healthy or disease states (e.g. normal vs cancerous tissue) or among different biological mechanisms, at the gene expression level. Thus, the literature is plenty of studies, where numerous feature selection techniques are applied, in an effort to reduce the noise and dimensionality of such datasets. However, little attention is given to the stability of these signatures, in cases where the original dataset is perturbed by adding, removing or simply resampling the original observations. In this article, we are assessing the stability of a set of well characterized public cancer microarray datasets, using five popular feature selection algorithms in the field of high-throughput genomics data analysis.
  • Keywords
    biological tissues; biology computing; cancer; feature selection; genetics; genomics; learning (artificial intelligence); pattern classification; sampling methods; DNA microarrays; RNA-Seq; biological mechanisms; cancerous tissue; dimensionality reduction; disease states; feature selection algorithm stability; gene expression level; gene signature identification; healthy states; high-throughput genomics datasets classification; machine learning techniques; noise reduction; normal tissue; observation addition; observation removal; observation resampling; public cancer microarray datasets; Bioinformatics; Cancer; Classification algorithms; Gene expression; Genomics; Stability criteria;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
  • Conference_Location
    Chania
  • Type

    conf

  • DOI
    10.1109/BIBE.2013.6701677
  • Filename
    6701677