DocumentCode :
2582683
Title :
Highly scalable and accurate seeds for subsequence alignment
Author :
Pol, Abhijit ; Kahveci, Tamer
Author_Institution :
Dept. of Comput. & Information Sci. & Eng., Florida Univ., Gainesville, FL, USA
fYear :
2005
fDate :
19-21 Oct. 2005
Firstpage :
27
Lastpage :
31
Abstract :
We propose a method for finding seeds for the local alignment of two nucleotide sequences. Our method uses randomized algorithms to find approximate seeds. We present a dynamic index to store the fingerprints of k-grams and a highly scalable and accurate (HSA) algorithm to incorporate randomization into process of seed generation. Experimental results show that our method produces better quality seeds with improved running time and memory usage compared to traditional non-spaced and spaced seeds. The presented algorithm scales very well with higher seed lengths while maintaining the quality and performance.
Keywords :
DNA; biology computing; molecular biophysics; molecular configurations; randomised algorithms; highly scalable accurate algorithms; k-grams; nucleotide sequences; randomized algorithms; seed generation; subsequence alignment; Bioinformatics; Costs; Data analysis; Databases; Dynamic programming; Fingerprint recognition; Heuristic algorithms; Information science; Scalability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
Print_ISBN :
0-7695-2476-1
Type :
conf
DOI :
10.1109/BIBE.2005.37
Filename :
1544445
Link To Document :
بازگشت