DocumentCode :
3230076
Title :
Model-based clustering with genes expression dynamics for time-course gene expression data
Author :
Wu, Fang-Xiang ; Zhang, Wen-Jun ; Kusalik, Anthony J.
Author_Institution :
Div. of Biomed. Eng., Saskatchewan Univ., Saskatoon, Sask., Canada
fYear :
2004
fDate :
19-21 May 2004
Firstpage :
267
Lastpage :
274
Abstract :
Microarray technologies are emerging as a promising tool for genomic studies. A huge body of time-course gene expression data has been and will continuously be produced by microarray experiments. Such gene expression data contains important information and has been proven useful in medical diagnosis, treatment, and drug design. The challenge now is how to analyze such data to obtain the inherent information. Cluster analysis has played an important role in analyzing time-course gene expression data. However, most clustering techniques do not take into consideration the inherent time dependence (dynamics) of time-course gene expression patterns. Accounting for the inherent dynamics of such data in cluster analysis should lead to higher quality clustering. This paper presents a model-based clustering method for time-course gene expression data. The presented method uses Markov chain models (MCMs) to account for the inherent dynamics of time-course gene expression patterns and assumes that expression patterns in the same cluster were generated by the same MCM. For the given number of clusters, the presented method computes cluster models using an EM algorithm and an assignment of genes to these models that maximizes their posterior probabilities. Further, this study employs the average adjusted Rand index (AARI) to evaluate the quality of clustering. The improved performance of the presented method is demonstrated by comparing to the k-means method on a publicly available dataset.
Keywords :
DNA; Markov processes; biology computing; data mining; genetics; molecular biophysics; pattern clustering; Markov chain models; adjusted Rand index; cluster analysis; gene expression data; genes expression dynamics; model-based clustering; Bioinformatics; Clustering methods; Data analysis; Drugs; Genetic expression; Genomics; Information analysis; Medical diagnosis; Medical treatment; Pharmaceutical technology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2004. BIBE 2004. Proceedings. Fourth IEEE Symposium on
Print_ISBN :
0-7695-2173-8
Type :
conf
DOI :
10.1109/BIBE.2004.1317353
Filename :
1317353
Link To Document :
بازگشت