DocumentCode
468401
Title
Accurate Classification of SAGE Data Based on Frequent Patterns of Gene Expression
Author
Tzanis, George ; Vlahavas, Ioannis
Author_Institution
Aristotle Univ. of Thessaloniki, Thessaloniki
Volume
1
fYear
2007
fDate
29-31 Oct. 2007
Firstpage
96
Lastpage
100
Abstract
In this paper we present a method for classifying accurately SAGE (serial analysis of gene expression) data. The high dimensionality of the data, namely the large number of features, in combination with the small number of samples poses a great challenge and demands more accurate and robust algorithms for classification. The prediction accuracy of the up to now proposed approaches is moderate. In our approach we exploit the associations among the expressions of genes in order to construct more accurate classifiers. For validating the effectiveness of our approach we experimented with two real datasets using numerous feature selection and classification algorithms. The results have shown that our approach improves significantly the classification accuracy, which reaches 99%.
Keywords
biology computing; classification; feature extraction; SAGE data classification; classification algorithm; feature selection; frequent pattern; serial analysis of gene expression; Accuracy; Biological information theory; Cancer; Classification algorithms; Data analysis; Data mining; Gene expression; Libraries; Proteins; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
Conference_Location
Patras
ISSN
1082-3409
Print_ISBN
978-0-7695-3015-4
Type
conf
DOI
10.1109/ICTAI.2007.131
Filename
4410269
Link To Document