DocumentCode
2515679
Title
Inexact Local Alignment Search over Suffix Arrays
Author
Ghodsi, Mohammadreza ; Pop, Mihai
Author_Institution
Dept. of Comput. Sci., Univ. of Maryland, College Park, MD, USA
fYear
2009
fDate
1-4 Nov. 2009
Firstpage
83
Lastpage
87
Abstract
We describe an algorithm for finding approximate seeds for DNA homology searches. In contrast to previous algorithms that use exact or spaced seeds, our approximate seeds may contain insertions and deletions. We present a generalized heuristic for finding such seeds efficiently and prove that the heuristic does not affect sensitivity. We show how to adapt this algorithm to work over the memory efficient suffix array with provably minimal overhead in running time. We demonstrate the effectiveness of our algorithm on two tasks: whole genome alignment of bacteria and alignment of the DNA sequences of 177 genes that are orthologous in human and mouse. We show our algorithm achieves better sensitivity and uses less memory than other commonly used local alignment tools.
Keywords
DNA; biology computing; genetics; genomics; microorganisms; molecular biophysics; DNA homology searches; DNA sequences; bacteria; generalized heuristic; genes; genome alignment; human; inexact local alignment search; local alignment tools; memory efficient suffix array; mouse; Bioinformatics; Computational biology; Costs; DNA; Educational institutions; Genomics; Organisms; Sensitivity and specificity; Sequences; USA Councils; inexact seeds; local alignment; suffix array;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine, 2009. BIBM '09. IEEE International Conference on
Conference_Location
Washington, DC
Print_ISBN
978-0-7695-3885-3
Type
conf
DOI
10.1109/BIBM.2009.25
Filename
5341855
Link To Document