Title :
An improvement of the overlap complexity in the spaced seed searching problem between genomic DNAs
Author :
Phan-Thuan Do;Cam-Giang Tran-Thi
Author_Institution :
School of Information and Communication Technology, Hanoi University of Science and Technology, Vietnam
Abstract :
In homology search, finding optimal multiple spaced seeds in genomic DNA sequences is NP-hard but even finding good ones is very difficult. The exponential-time algorithm PatternHunter use optimal spaced seeds to increase both the sensitivity and the speed of homology search. The overlap complexity measure based on the overlaps between hits of a multiple seed are well correlated with sensitivity but is computable in polynomial time. Based on overlap complexity, we have improved polynomial-time algorithms to provide better multiple seeds. Our experimental results shows that these improvements significantly run faster and make better quality of spaced seeds than previous algorithms in almost all test cases.
Keywords :
"Sensitivity","Complexity theory","Heuristic algorithms","Computer science","Search problems","Bioinformatics","DNA"
Conference_Titel :
Information and Computer Science (NICS), 2015 2nd National Foundation for Science and Technology Development Conference on
Print_ISBN :
978-1-4673-6639-7
DOI :
10.1109/NICS.2015.7302205