DocumentCode :
1018443
Title :
High-Performance Direct Pairwise Comparison of Large Genomic Sequences
Author :
Mueller, Christopher ; Dalkilic, Mehmet M. ; Lumsdaine, Andrew
Author_Institution :
Dept. of Comput. Sci., Indiana Univ., Bloomington, IN
Volume :
17
Issue :
8
fYear :
2006
Firstpage :
764
Lastpage :
772
Abstract :
Many applications in comparative genomics lend themselves to implementations that take advantage of common high-performance features in modern microprocessors. However, the common suggestion that a data-parallel, multithreaded, or high-throughput implementation is possible often ignores the complexity of actually creating such software. In this paper, we present two parallel algorithms for a classic comparative genomics algorithm, the dot plot. First, we describe a data-parallel algorithm that achieves speedups of up to 14.4x over the sequential version for large genomic comparisons. Then, we use the new algorithm as the base for a coarse-grained parallel version, suitable for multiprocessor and cluster environments, that scales linearly with the number of processors. These speedups introduce the opportunity to perform full pairwise comparisons on entire genomes on a much larger scale than previously possible. We also present the experimental, model-driven approach used to develop the algorithm that allowed us to carefully study and evaluate implementation options and to fully understand the parameters affecting its performance
Keywords :
DNA; biology computing; genetics; multiprocessing systems; parallel algorithms; sequences; cluster environments; data-parallel algorithm; dot plot; genomic algorithm; genomic sequences; model-driven approach; multiprocessor; Application software; Bioinformatics; Clustering algorithms; Databases; Genomics; Helium; High performance computing; Microprocessors; Parallel algorithms; Vector processors; Altivec; Dot plot; comparative genomics; data-parallel; high-performance computing; pairwise comparison; performance measures.; sequence alignment; vector processor;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2006.104
Filename :
1652940
Link To Document :
بازگشت