• DocumentCode
    468401
  • Title

    Accurate Classification of SAGE Data Based on Frequent Patterns of Gene Expression

  • Author

    Tzanis, George ; Vlahavas, Ioannis

  • Author_Institution
    Aristotle Univ. of Thessaloniki, Thessaloniki
  • Volume
    1
  • fYear
    2007
  • fDate
    29-31 Oct. 2007
  • Firstpage
    96
  • Lastpage
    100
  • Abstract
    In this paper we present a method for classifying accurately SAGE (serial analysis of gene expression) data. The high dimensionality of the data, namely the large number of features, in combination with the small number of samples poses a great challenge and demands more accurate and robust algorithms for classification. The prediction accuracy of the up to now proposed approaches is moderate. In our approach we exploit the associations among the expressions of genes in order to construct more accurate classifiers. For validating the effectiveness of our approach we experimented with two real datasets using numerous feature selection and classification algorithms. The results have shown that our approach improves significantly the classification accuracy, which reaches 99%.
  • Keywords
    biology computing; classification; feature extraction; SAGE data classification; classification algorithm; feature selection; frequent pattern; serial analysis of gene expression; Accuracy; Biological information theory; Cancer; Classification algorithms; Data analysis; Data mining; Gene expression; Libraries; Proteins; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
  • Conference_Location
    Patras
  • ISSN
    1082-3409
  • Print_ISBN
    978-0-7695-3015-4
  • Type

    conf

  • DOI
    10.1109/ICTAI.2007.131
  • Filename
    4410269