DocumentCode
3205442
Title
Smith-Waterman Alignment of Huge Sequences with GPU in Linear Space
Author
de O. Sandes, Edans Flavius ; De Melo, Alba Cristina M A
Author_Institution
Dept. of Comput. Sci., Univ. of Brasilia (UnB), Brasilia, Brazil
fYear
2011
fDate
16-20 May 2011
Firstpage
1199
Lastpage
1211
Abstract
Cross-species chromosome alignments can reveal ancestral relationships and may be used to identify the peculiarities of the species. It is thus an important problem in Bioinformatics. So far, aligning huge sequences, such as whole chromosomes, with exact methods has been regarded as unfeasible, due to huge computing and memory requirements. However, high performance computing platforms such as GPUs are being able to change this scenario, making it possible to obtain the exact result for huge sequences in reasonable time. In this paper, we propose and evaluate a parallel algorithm that uses GPU to align huge sequences, executing the Smith-Waterman algorithm combined with Myers-Miller, with linear space complexity. In order to achieve that, we propose optimizations that are able to reduce significantly the amount of data processed and that enforce full parallelism most of the time. Using the GTX 285 Board, our algorithm was able to produce the optimal alignment between sequences composed of 33 Millions of Base Pairs (MBP) and 47 MBP in 18.5 hours.
Keywords
bioinformatics; cellular biophysics; coprocessors; parallel algorithms; GPU; GTX 285 Board; Myers-Miller algorithm; Smith-Waterman alignment; ancestral relationships; bioinformatics; cross-species chromosome alignments; high performance computing platform; linear space complexity; parallel algorithm; species peculiarity identification; Bioinformatics; Computer architecture; Graphics processing unit; Heuristic algorithms; Instruction sets; Mathematical model; Microprocessors;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
Conference_Location
Anchorage, AK
ISSN
1530-2075
Print_ISBN
978-1-61284-372-8
Electronic_ISBN
1530-2075
Type
conf
DOI
10.1109/IPDPS.2011.114
Filename
6012857
Link To Document