Title :
Algorithms for bounded-error correlation of high dimensional data in microarray experiments
Author :
Koyuturk, Mehmet ; Grama, Ananth ; Szpankowski, Wojciechn
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
Abstract :
The problem of clustering continuous valued data has been well studied in literature. Its application to microarray analysis relies on such algorithms as k-means, dimensionality reduction techniques, and graph-based approaches for building dendrograms of sample data. In contrast, similar problems for discrete-attributed data are relatively unexplored. An instance of analysis of discrete-attributed data arises in detecting co-regulated samples in microarrays. In this paper, we present an algorithm and a software framework, PROXIMUS, for error-bounded clustering of high-dimensional discrete attributed datasets in the context of extracting co-regulated samples from microarray data. We show that PROXIMUS delivers outstanding performance in extracting accurate patterns of gene-expression.
Keywords :
arrays; biocomputing; correlation methods; genetics; pattern clustering; PROXIMUS; bounded-error correlation; building dendrograms; continuous valued data clustering problem; dimensionality reduction techniques; discrete-attributed data; error-bounded clustering; gene-expression; k-means algorithm; microarray analysis; software framework; Bioinformatics;
Conference_Titel :
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN :
0-7695-2000-6
DOI :
10.1109/CSB.2003.1227412