Title :
New approaches to clustering microarray time-series data using multiple expression profile alignment
Author :
Subhani, Numanul ; Rueda, Luis ; Ngom, Alioune ; Burden, Conrad
Author_Institution :
Sch. of Comput. Sci., Univ. of Windsor, Windsor, ON, Canada
Abstract :
An important process in functional genomic studies is clustering microarray time-series data, where genes with similar expression profiles are expected to be functionally related. Clustering microarray time-series data via pairwise alignment of piecewise linear profiles has been recently introduced. In this paper, we propose a clustering approach based on a multiple profile alignment of natural cubic spline and piecewise linear representations of gene expression profiles. We combine these multiple alignment approaches with k-means. We ran our methods on a well-known data set of pre-clustered Saccharomyces cerevisiae gene expression profiles and a data set of 3315 Pseudomonas aeruginosa expression profiles. We assessed the validity of the resulting clusters and applied a c-nearest neighbor classifier for evaluating the performance of our approaches, obtaining accuracies of 89.51% and 86.12% respectively, on Saccharomyces cerevisiae data, and 90.90% and 93.71% accuracies for cubic spline and piecewise linear respectively on Pseudomonas aeruginosa data.
Keywords :
biology computing; genomics; pattern clustering; splines (mathematics); time series; Pseudomonas aeruginosa expression profiles; Saccharomyces cerevisiae gene expression profiles; c-nearest neighbor classifier; data clustering; functional genomic studies; gene expression profiles; microarray time-series data; multiple expression profile alignment; multiple profile alignment; natural cubic spline; piecewise linear profiles; Bayesian methods; Bioinformatics; Euclidean distance; Gene expression; Genomics; Piecewise linear techniques; Radio access networks; Self organizing feature maps; Spline; Time series analysis; Clustering; Cubic Spline; Gene Expression Profiles; Microarrays; Piece-wise Linear; Profile Alignment; Time-Series Data;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2010 IEEE Symposium on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4244-6766-2
DOI :
10.1109/CIBCB.2010.5510385