DocumentCode :
1783249
Title :
Finding Motifs in Biological Sequences Using the Micron Automata Processor
Author :
ROY, INDRANIL ; Aluru, Srinivas
Author_Institution :
Sch. of Comput. Sci. & Eng., Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
2014
fDate :
19-23 May 2014
Firstpage :
415
Lastpage :
424
Abstract :
Finding approximately conserved sequences, called motifs, across multiple DNA or protein sequences is an important problem in computational biology. In this paper, we consider the (l, d) motif search problem of identifying one or more motifs of length l present in at least q of the n given sequences, with each occurrence differing from the motif in at most d substitutions. The problem is known to be NP-hard, and the largest solved instance reported to date is (26, 11). We propose a novel algorithm for the (l, d) motif search problem using streaming execution over a large set of Non-deterministic Finite Automata (NFA). This solution is designed to take advantage of the Micron Automata Processor, a new technology close to deployment that can simultaneously execute multiple NFA in parallel. We estimate the run-time for the (39, 18) and (40, 17) problem instances using the resources available within a single Automata Processor board. In addition to solving larger instances of the (l, d) motif search problem, the paper serves as a useful guide to solving problems using this new accelerator technology.
Keywords :
DNA; biology computing; computational complexity; finite automata; proteins; search problems; DNA; NFA; NP-hard; accelerator technology; approximately conserved sequences; biological sequences; computational biology; largest solved instance; micron automata processor; motif search problem; nondeterministic finite automata; protein sequences; streaming execution; Algorithm design and analysis; Automata; Finite element analysis; Hamming distance; Hardware; Search problems; Silicon; computational biology; finite automaton; graph algorithms; hardware acceleration; motif detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2014 IEEE 28th International
Conference_Location :
Phoenix, AZ
ISSN :
1530-2075
Print_ISBN :
978-1-4799-3799-8
Type :
conf
DOI :
10.1109/IPDPS.2014.51
Filename :
6877275
Link To Document :
بازگشت