Title :
Efficient approximate and dynamic matching of patterns using a labeling paradigm
Author :
Sahinalp, Süleyman Cenk ; Vishkin, Uzi
Author_Institution :
Maryland Univ., College Park, MD, USA
Abstract :
A key approach in string processing algorithmics has been the labeling paradigm which is based on assigning labels to some of the substrings of a given string. If these labels are chosen consistently, they can enable fast comparisons of substrings. Until the first optimal parallel algorithm for suffix tree construction was given by the authors in 1994 the labeling paradigm was considered not to be competitive with other approaches. They show that this general method is also useful for several central problems in the area of string processing: approximate string matching, dynamic dictionary matching, and dynamic text indexing. The approximate string matching problem deals with finding all substrings of a text which match a pattern “approximately”, i.e., with at most m differences. The differences can be in the form of inserted, deleted, or replaced characters. The text indexing problem deals with finding all occurrences of a pattern in a text, after the text is preprocessed. In the dynamic text indexing problem, updates to the text in the form of insertions and deletions of substrings are permitted. The dictionary matching problem deals with finding all occurrences of each pattern set of a set of patterns in a text, after the pattern set is preprocessed. In the dynamic dictionary matching problem, insertions and deletions of patterns to the pattern set are permitted
Keywords :
computational complexity; indexing; parallel algorithms; pattern matching; string matching; tree data structures; approximate string matching; deleted characters; dynamic dictionary matching; dynamic text indexing; efficient approximate pattern matching; efficient dynamic pattern matching; inserted characters; labeling paradigm; optimal parallel algorithm; replaced characters; string processing algorithmics; substrings; suffix tree construction; Dictionaries; Educational institutions; Heuristic algorithms; Indexing; Labeling; Parallel algorithms; Pattern matching; Tree data structures;
Conference_Titel :
Foundations of Computer Science, 1996. Proceedings., 37th Annual Symposium on
Conference_Location :
Burlington, VT
Print_ISBN :
0-8186-7594-2
DOI :
10.1109/SFCS.1996.548491