DocumentCode
680201
Title
StemFinder: An efficient algorithm for searching motif stems over large alphabets
Author
Qiang Yu ; Hongwei Huo ; Vitter, Jeffrey Scott ; Jun Huan ; Nekrich, Yakov
Author_Institution
Sch. of Comput. Sci. & Technol., Xidian Univ., Xi´an, China
fYear
2013
fDate
18-21 Dec. 2013
Firstpage
473
Lastpage
476
Abstract
Motif stem search (MSS) is a recent motif search problem to search motifs on large-alphabet inputs. A motif stem is an l-length string with some wildcards. The goal of the MSS problem is to find a set of stems that represents a superset of all (l, d) motifs present in the input sequences. The three main contributions of this paper are as follows: (1) We build motif stem representation more precisely by using regular expressions. (2) We give a new method for generating all possible motif stems. (3) We propose an efficient algorithm, called StemFinder, for solving the MSS problem. Compared with the previous algorithms, StemFinder runs much faster and first solves the (17, 8), (19, 9) and (21, 10) challenging instances on protein sequences; moreover, StemFinder reports fewer stems representing a smaller superset of all (l, d) motifs.
Keywords
molecular biophysics; molecular configurations; proteins; StemFinder algorithm; l-length string; large-alphabet inputs; motif stem representation; motif stem search; protein sequences; Algorithm design and analysis; Bioinformatics; DNA; Radiation detectors; Search problems; Silicon; Time complexity; Planted motif search; exact algorithms; motif stem search; regular expressions;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location
Shanghai
Type
conf
DOI
10.1109/BIBM.2013.6732539
Filename
6732539
Link To Document