DocumentCode :
680201
Title :
StemFinder: An efficient algorithm for searching motif stems over large alphabets
Author :
Qiang Yu ; Hongwei Huo ; Vitter, Jeffrey Scott ; Jun Huan ; Nekrich, Yakov
Author_Institution :
Sch. of Comput. Sci. & Technol., Xidian Univ., Xi´an, China
fYear :
2013
fDate :
18-21 Dec. 2013
Firstpage :
473
Lastpage :
476
Abstract :
Motif stem search (MSS) is a recent motif search problem to search motifs on large-alphabet inputs. A motif stem is an l-length string with some wildcards. The goal of the MSS problem is to find a set of stems that represents a superset of all (l, d) motifs present in the input sequences. The three main contributions of this paper are as follows: (1) We build motif stem representation more precisely by using regular expressions. (2) We give a new method for generating all possible motif stems. (3) We propose an efficient algorithm, called StemFinder, for solving the MSS problem. Compared with the previous algorithms, StemFinder runs much faster and first solves the (17, 8), (19, 9) and (21, 10) challenging instances on protein sequences; moreover, StemFinder reports fewer stems representing a smaller superset of all (l, d) motifs.
Keywords :
molecular biophysics; molecular configurations; proteins; StemFinder algorithm; l-length string; large-alphabet inputs; motif stem representation; motif stem search; protein sequences; Algorithm design and analysis; Bioinformatics; DNA; Radiation detectors; Search problems; Silicon; Time complexity; Planted motif search; exact algorithms; motif stem search; regular expressions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location :
Shanghai
Type :
conf
DOI :
10.1109/BIBM.2013.6732539
Filename :
6732539
Link To Document :
بازگشت