DocumentCode :
468401
Title :
Accurate Classification of SAGE Data Based on Frequent Patterns of Gene Expression
Author :
Tzanis, George ; Vlahavas, Ioannis
Author_Institution :
Aristotle Univ. of Thessaloniki, Thessaloniki
Volume :
1
fYear :
2007
fDate :
29-31 Oct. 2007
Firstpage :
96
Lastpage :
100
Abstract :
In this paper we present a method for classifying accurately SAGE (serial analysis of gene expression) data. The high dimensionality of the data, namely the large number of features, in combination with the small number of samples poses a great challenge and demands more accurate and robust algorithms for classification. The prediction accuracy of the up to now proposed approaches is moderate. In our approach we exploit the associations among the expressions of genes in order to construct more accurate classifiers. For validating the effectiveness of our approach we experimented with two real datasets using numerous feature selection and classification algorithms. The results have shown that our approach improves significantly the classification accuracy, which reaches 99%.
Keywords :
biology computing; classification; feature extraction; SAGE data classification; classification algorithm; feature selection; frequent pattern; serial analysis of gene expression; Accuracy; Biological information theory; Cancer; Classification algorithms; Data analysis; Data mining; Gene expression; Libraries; Proteins; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
Conference_Location :
Patras
ISSN :
1082-3409
Print_ISBN :
978-0-7695-3015-4
Type :
conf
DOI :
10.1109/ICTAI.2007.131
Filename :
4410269
Link To Document :
بازگشت