DocumentCode
2145601
Title
Pattern Matching with Flexible Wildcards and Recurring Characters
Author
Wang, Haiping ; Xie, Fei ; Hu, Xuegang ; Li, Peipei ; Wu, Xindong
Author_Institution
Sch. of Comput. Sci. & Inf. Eng., Hefei Univ. of Technol., Hefei, China
fYear
2010
fDate
14-16 Aug. 2010
Firstpage
782
Lastpage
786
Abstract
Pattern matching is an important task, which is widely used in many fields, such as information retrieval and bioinformatics. Recently, a much more flexible pattern matching problem with wildcards has been proposed. Chen et al. introduced local constraints, global constraints and the one-off condition into the task of pattern matching, and the most representative algorithm SAIL was designed. However, the performance of SAIL is not analyzed well, which affects its application. Therefore, this paper analyzes the performance of SAIL in-depth, and discovers that the matching result is closely related to the features of patterns. Meanwhile, the completeness of SAIL in the pattern matching with no-recurring characters is proved, and an improved algorithm, named RSAIL, is proposed for pattern matching with recurring tail characters. Extensive experiments demonstrate that RSAIL improves the number of matches by 2.2% compared to SAIL.
Keywords
pattern matching; RSAIL; bioinformatics; flexible wildcards; information retrieval; pattern matching; recurring characters; representative algorithm; tail characters; Algorithm design and analysis; Approximation algorithms; Bioinformatics; Classification algorithms; Information retrieval; Pattern matching; USA Councils; SAIL; completeness; matching; wildcard;
fLanguage
English
Publisher
ieee
Conference_Titel
Granular Computing (GrC), 2010 IEEE International Conference on
Conference_Location
San Jose, CA
Print_ISBN
978-1-4244-7964-1
Type
conf
DOI
10.1109/GrC.2010.156
Filename
5576076
Link To Document