Title :
PRUNER: algorithms for finding monad patterns in DNA sequences
Author :
VijayaSatya, Ravi ; Mukheqee, A.
Author_Institution :
Central Florida Univ., Orlando, FL, USA
Abstract :
We present new algorithms for discovering monad patterns in DNA sequences. Monad patterns are of the form (l,d)-k, where l is the length of the pattern, d is the maximum number of mismatches allowed, and k is the minimum number of times the pattern is repeated in the given sample. The time-complexity of some of the best known algorithms to date is O(nt2ld|Σ|d2/), where t is the number of input sequences, and n is the length of each input sequence. The first algorithm that we present takes O(n2t2ld2/|Σ|d2/) and space O(ntld2/|Σ|d2/), and the second algorithm takes O(n3t3ld2/|Σ|d2/ time using O(ld2/|Σ|d2/) space. In practice, our algorithms have much better performance provided the d/l ratio is small.
Keywords :
DNA; biology computing; computational complexity; molecular biophysics; DNA sequences; PRUNER; mismatch pattern; monad patterns; time complexity; Bioinformatics; Computational biology; Computer science; DNA; Genetic mutations; Graph theory; NP-complete problem; Sequences;
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
DOI :
10.1109/CSB.2004.1332537