DocumentCode
1713873
Title
A Nettree for Approximate Maximal Pattern Matching with Gaps and One-Off Constraint
Author
Wu, Youxi ; Wu, Xindong ; Jiang, He ; Min, Fan
Author_Institution
Sch. of Comput. Sci. & Software, Hebei Univ. of Technol., Tianjin, China
Volume
2
fYear
2010
Firstpage
38
Lastpage
41
Abstract
Recently, pattern matching with flexible gap constraints has attracted extensive attention especially in biological sequence analysis and mining patterns from sequences. An issue is to search Maximal Pattern Matching with Gaps and the One-Off Condition (MPMGOOC). Firstly, we introduce the concept of MPMGOOC. In order to solve the problem, we propose some special concepts of Nettree which is different from a tree in that a node may have more than one parent. Based on Nettree, an algorithm named Heuristic Search Occurrence (HSO) is proposed. The space and time complexities of the algorithm are O(W*m*n) and O(W*n*(n+m*m)) respectively, where m, n, and W are the length of pattern P, sequence S and the maximal gap respectively. The comparison results show that HSO achieves better performance than a state-of-the-art algorithm in most cases of the real-world biological data testing.
Keywords
DNA; Internet; biology computing; data mining; pattern matching; tree data structures; HSO; Nettree; biological sequence analysis; gap constraints; heuristic search occurrence; maximal pattern matching; pattern mining; pattern sequence; Biological information theory; Complexity theory; Data mining; Heuristic algorithms; Pattern matching; Testing; Nettree; One-off condition; Pattern matching;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence (ICTAI), 2010 22nd IEEE International Conference on
Conference_Location
Arras
ISSN
1082-3409
Print_ISBN
978-1-4244-8817-9
Type
conf
DOI
10.1109/ICTAI.2010.81
Filename
5671434
Link To Document