DocumentCode :
2691422
Title :
Efficient filtration for similarity search with spaced k-mer neighbors
Author :
Li, Weiming ; Ma, Bin ; Zhang, Kaizhong
Author_Institution :
Dept. of Comput. Sci., Univ. of Western Ontario, London, ON, Canada
fYear :
2012
fDate :
4-7 Oct. 2012
Firstpage :
1
Lastpage :
6
Abstract :
In DNA and protein sequence similarity search, seeding (or filtration) has been widely used to trade search sensitivity with search speed. In this paper, a new seeding method, called spaced k-mer neighbors, is introduced to provide a more efficient tradeoff between the speed and sensitivity in protein similarity search. The new method pre-selects a set of spaced k-mers as neighbors, and uses the neighbors to detect hits between the query and database sequences. An efficient heuristic algorithm is proposed for the neighbor selection. We demonstrate that the method can improve the tradeoff efficiency over existing seeding methods.
Keywords :
DNA; bioinformatics; biological techniques; molecular biophysics; proteins; query processing; DNA similarity search; database sequences; filtration efficiency; heuristic algorithm; neighbor selection; protein sequence similarity search; query sequences; search sensitivity; search speed; seeding method; spaced k-mer neighbors; Amino acids; DNA; Databases; Humans; Proteins; Sensitivity; Training; homology search; similarity search; spaced seeds;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2559-2
Electronic_ISBN :
978-1-4673-2558-5
Type :
conf
DOI :
10.1109/BIBM.2012.6392695
Filename :
6392695
Link To Document :
بازگشت