Title :
High performance computational tools for Motif discovery
Author :
Baldwin, N.E. ; Collins, R.L. ; Langston, M.A. ; Symons, C.T. ; Leuze, M.R. ; Voy, B.H.
Author_Institution :
Dept. of Comput. Sci., Tennessee Univ., TN, USA
Abstract :
Summary form only given. We highlight a fruitful interplay between biology and computation. The sequencing of complete genomes from multiple organisms has revealed that most differences in organism complexity are due to elements of gene regulation that reside in the non protein coding portions of genes. Both within and between species, transcription factor binding sites and the proteins that recognize them govern the activity of cellular pathways that mediate adaptive responses and survival. Experimental identification of these regulatory elements is by nature a slow process. The availability of complete genomic sequences, however, opens the door for computational methods to predict binding sites and expedite our understanding of gene regulation at a genomic level. Just as with traditional experimental approaches, the computational identification of the molecular factors that control a gene´s expression level has been problematic. As a case in point, the identification of putative motifs is a challenging combinatorial task. For it, powerful new motif finding algorithms and high performance implementations are described. Heavy use is made of graph algorithms, some of which are exceedingly computationally intensive and involve the use of emergent mathematical methods. An approach to fully dynamic load balancing is developed in order to make effective use of highly parallel platforms.
Keywords :
biology computing; cellular biophysics; genetics; microorganisms; molecular biophysics; parallel processing; proteins; resource allocation; cellular pathway; computational identification; dynamic load balancing; gene expression level; gene regulation; genome; graph algorithm; high performance computational tool; highly parallel platforms; molecular factor; motif discovery; nonprotein coding portion; transcription factor binding site; Bioinformatics; Computational Intelligence Society; DNA; Gene expression; Genomics; High performance computing; Humans; Mice; Proteins; Sequences;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
DOI :
10.1109/IPDPS.2004.1303210