Title :
ParTriCluster: A Scalable Parallel Algorithm for Gene Expression Analysis
Author :
Araujo, R. ; Trielli, G. ; Orair, G. ; Meira, W. ; Ferreira, R. ; Guedes, D.
Author_Institution :
Dept. of Comput. Sci., Univ. Fed. de Minas Gerais
Abstract :
Analyzing gene expression patterns is becoming a highly relevant task in the bio informatics area. This analysis makes it possible to determine the behavior patterns of genes under various conditions, a fundamental information for treating diseases, among other applications. An advance in this area is the tricluster algorithm, which is the first algorithm capable of determining 3D clusters, that is, it determines clusters of sets of genes that behave similarly in a set of samples and set of time stamps. However, while biological experiments collect an increasing amount of data to be analyzed and correlated, the triclustering problem is NP-complete, and its parallelization seems to be an essential step towards obtaining feasible solutions. In this paper we propose and evaluate the implementation of a parallel version of the tricluster algorithm using the filter-labeled-stream paradigm supported by the Anthill parallel programming environment. The results show that our parallelization scales linearly with the data size. Further, the parallelization strategy is applicable to any depth-first searches
Keywords :
biology computing; computational complexity; genetics; parallel algorithms; parallel programming; pattern clustering; programming environments; tree searching; Anthill parallel programming environment; NP-complete problem; ParTriCluster scalable parallel algorithm; bioinformatics; depth-first searches; filter-labeled-stream paradigm; gene expression analysis; pattern clustering; Algorithm design and analysis; Clustering algorithms; Data analysis; Diseases; Gene expression; Informatics; Information analysis; Parallel algorithms; Parallel programming; Pattern analysis;
Conference_Titel :
Computer Architecture and High Performance Computing, 2006. SBAC-PAD '06. 18TH International Symposium on
Conference_Location :
Ouro Preto
Print_ISBN :
0-7695-2704-3
DOI :
10.1109/SBAC-PAD.2006.20