• DocumentCode
    1713873
  • Title

    A Nettree for Approximate Maximal Pattern Matching with Gaps and One-Off Constraint

  • Author

    Wu, Youxi ; Wu, Xindong ; Jiang, He ; Min, Fan

  • Author_Institution
    Sch. of Comput. Sci. & Software, Hebei Univ. of Technol., Tianjin, China
  • Volume
    2
  • fYear
    2010
  • Firstpage
    38
  • Lastpage
    41
  • Abstract
    Recently, pattern matching with flexible gap constraints has attracted extensive attention especially in biological sequence analysis and mining patterns from sequences. An issue is to search Maximal Pattern Matching with Gaps and the One-Off Condition (MPMGOOC). Firstly, we introduce the concept of MPMGOOC. In order to solve the problem, we propose some special concepts of Nettree which is different from a tree in that a node may have more than one parent. Based on Nettree, an algorithm named Heuristic Search Occurrence (HSO) is proposed. The space and time complexities of the algorithm are O(W*m*n) and O(W*n*(n+m*m)) respectively, where m, n, and W are the length of pattern P, sequence S and the maximal gap respectively. The comparison results show that HSO achieves better performance than a state-of-the-art algorithm in most cases of the real-world biological data testing.
  • Keywords
    DNA; Internet; biology computing; data mining; pattern matching; tree data structures; HSO; Nettree; biological sequence analysis; gap constraints; heuristic search occurrence; maximal pattern matching; pattern mining; pattern sequence; Biological information theory; Complexity theory; Data mining; Heuristic algorithms; Pattern matching; Testing; Nettree; One-off condition; Pattern matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2010 22nd IEEE International Conference on
  • Conference_Location
    Arras
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4244-8817-9
  • Type

    conf

  • DOI
    10.1109/ICTAI.2010.81
  • Filename
    5671434