DocumentCode :
1713873
Title :
A Nettree for Approximate Maximal Pattern Matching with Gaps and One-Off Constraint
Author :
Wu, Youxi ; Wu, Xindong ; Jiang, He ; Min, Fan
Author_Institution :
Sch. of Comput. Sci. & Software, Hebei Univ. of Technol., Tianjin, China
Volume :
2
fYear :
2010
Firstpage :
38
Lastpage :
41
Abstract :
Recently, pattern matching with flexible gap constraints has attracted extensive attention especially in biological sequence analysis and mining patterns from sequences. An issue is to search Maximal Pattern Matching with Gaps and the One-Off Condition (MPMGOOC). Firstly, we introduce the concept of MPMGOOC. In order to solve the problem, we propose some special concepts of Nettree which is different from a tree in that a node may have more than one parent. Based on Nettree, an algorithm named Heuristic Search Occurrence (HSO) is proposed. The space and time complexities of the algorithm are O(W*m*n) and O(W*n*(n+m*m)) respectively, where m, n, and W are the length of pattern P, sequence S and the maximal gap respectively. The comparison results show that HSO achieves better performance than a state-of-the-art algorithm in most cases of the real-world biological data testing.
Keywords :
DNA; Internet; biology computing; data mining; pattern matching; tree data structures; HSO; Nettree; biological sequence analysis; gap constraints; heuristic search occurrence; maximal pattern matching; pattern mining; pattern sequence; Biological information theory; Complexity theory; Data mining; Heuristic algorithms; Pattern matching; Testing; Nettree; One-off condition; Pattern matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2010 22nd IEEE International Conference on
Conference_Location :
Arras
ISSN :
1082-3409
Print_ISBN :
978-1-4244-8817-9
Type :
conf
DOI :
10.1109/ICTAI.2010.81
Filename :
5671434
Link To Document :
بازگشت