• DocumentCode
    2145601
  • Title

    Pattern Matching with Flexible Wildcards and Recurring Characters

  • Author

    Wang, Haiping ; Xie, Fei ; Hu, Xuegang ; Li, Peipei ; Wu, Xindong

  • Author_Institution
    Sch. of Comput. Sci. & Inf. Eng., Hefei Univ. of Technol., Hefei, China
  • fYear
    2010
  • fDate
    14-16 Aug. 2010
  • Firstpage
    782
  • Lastpage
    786
  • Abstract
    Pattern matching is an important task, which is widely used in many fields, such as information retrieval and bioinformatics. Recently, a much more flexible pattern matching problem with wildcards has been proposed. Chen et al. introduced local constraints, global constraints and the one-off condition into the task of pattern matching, and the most representative algorithm SAIL was designed. However, the performance of SAIL is not analyzed well, which affects its application. Therefore, this paper analyzes the performance of SAIL in-depth, and discovers that the matching result is closely related to the features of patterns. Meanwhile, the completeness of SAIL in the pattern matching with no-recurring characters is proved, and an improved algorithm, named RSAIL, is proposed for pattern matching with recurring tail characters. Extensive experiments demonstrate that RSAIL improves the number of matches by 2.2% compared to SAIL.
  • Keywords
    pattern matching; RSAIL; bioinformatics; flexible wildcards; information retrieval; pattern matching; recurring characters; representative algorithm; tail characters; Algorithm design and analysis; Approximation algorithms; Bioinformatics; Classification algorithms; Information retrieval; Pattern matching; USA Councils; SAIL; completeness; matching; wildcard;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing (GrC), 2010 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    978-1-4244-7964-1
  • Type

    conf

  • DOI
    10.1109/GrC.2010.156
  • Filename
    5576076