• DocumentCode
    1476474
  • Title

    Investigating the Minimum Required Number of Genes for the Classification of Neuromuscular Disease Microarray Data

  • Author

    Sakellariou, Argiris ; Sanoudou, Despina ; Spyrou, George

  • Author_Institution
    Biomed. Res. Found., Acad. of Athens, Athens, Greece
  • Volume
    15
  • Issue
    3
  • fYear
    2011
  • fDate
    5/1/2011 12:00:00 AM
  • Firstpage
    349
  • Lastpage
    355
  • Abstract
    The discovery of potential microarray markers, which will expedite molecular diagnosis/prognosis and provide reliable results to clinical decision-making and treatment selection for patients, is of paramount importance. Feature selection techniques, which aim at minimizing the dimensionality of the microarray data by keeping the most statistically significant genes, are a powerful approach toward this goal. In this paper, we investigate the minimum required subsets of genes, which best classify neuromuscular disease data. For this purpose, we implemented a methodology pipeline that facilitated the use of multiple feature selection methods and subsequent performance of data classification. Five feature selection methods on datasets from ten different neuromuscular diseases were utilized. Our findings reveal subsets of very small number of genes, which can successfully classify normal/disease samples. Interestingly, we observe that similar classification results may be obtained from different subsets of genes. The proposed methodology can expedite the identification of small gene subsets with high-classification accuracy that could ultimately be used in the genetics clinics for diagnostic, prognostic, and pharmacogenomic purposes.
  • Keywords
    diseases; genetics; genomics; medical administrative data processing; medical computing; molecular biophysics; neurophysiology; patient diagnosis; gene minimum required number; genetics clinics; methodology pipeline; multiple feature selection methods; neuromuscular disease microarray data classification; normal-disease samples; patient diagnostics; pharmacogenomic application; Accuracy; Biomarkers; Classification algorithms; Diseases; Neuromuscular; Pipelines; Probes; Feature selection; microarray data analysis; molecular diagnosis; neuromuscular diseases; Algorithms; Artificial Intelligence; Computational Biology; Gene Expression Profiling; Humans; Molecular Diagnostic Techniques; Neuromuscular Diseases; Oligonucleotide Array Sequence Analysis;
  • fLanguage
    English
  • Journal_Title
    Information Technology in Biomedicine, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-7771
  • Type

    jour

  • DOI
    10.1109/TITB.2011.2130531
  • Filename
    5735227