Title :
A Nettree for Approximate Maximal Pattern Matching with Gaps and One-Off Constraint
Author :
Wu, Youxi ; Wu, Xindong ; Jiang, He ; Min, Fan
Author_Institution :
Sch. of Comput. Sci. & Software, Hebei Univ. of Technol., Tianjin, China
Abstract :
Recently, pattern matching with flexible gap constraints has attracted extensive attention especially in biological sequence analysis and mining patterns from sequences. An issue is to search Maximal Pattern Matching with Gaps and the One-Off Condition (MPMGOOC). Firstly, we introduce the concept of MPMGOOC. In order to solve the problem, we propose some special concepts of Nettree which is different from a tree in that a node may have more than one parent. Based on Nettree, an algorithm named Heuristic Search Occurrence (HSO) is proposed. The space and time complexities of the algorithm are O(W*m*n) and O(W*n*(n+m*m)) respectively, where m, n, and W are the length of pattern P, sequence S and the maximal gap respectively. The comparison results show that HSO achieves better performance than a state-of-the-art algorithm in most cases of the real-world biological data testing.
Keywords :
DNA; Internet; biology computing; data mining; pattern matching; tree data structures; HSO; Nettree; biological sequence analysis; gap constraints; heuristic search occurrence; maximal pattern matching; pattern mining; pattern sequence; Biological information theory; Complexity theory; Data mining; Heuristic algorithms; Pattern matching; Testing; Nettree; One-off condition; Pattern matching;
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2010 22nd IEEE International Conference on
Conference_Location :
Arras
Print_ISBN :
978-1-4244-8817-9
DOI :
10.1109/ICTAI.2010.81