DocumentCode
2944784
Title
The String-to-Dictionary Matching Problem
Author
Klein, Shmuel T. ; Shapira, Dana
Author_Institution
Dept. of Comput. Sci., Bar Ilan Univ., Ramat Gan, Israel
fYear
2011
fDate
29-31 March 2011
Firstpage
143
Lastpage
152
Abstract
The String-to-Dictionary Matching Problem is defined, in which a string is searched for in all the possible concatenations of the elements of a given dictionary, with applications to compressed matching in variable to fixed length encodings, such as Tunstall´s. An algorithm based on suffix trees is suggested and experiments on natural language text are presented suggesting that compressed search might use less comparisons for long enough patterns, in spite of a potentially large number of encodings.
Keywords
data compression; dictionaries; encoding; information retrieval; natural language processing; string matching; compressed matching; fixed length encoding; natural language text; string searching; string-to-dictionary matching problem; Computer science; Data compression; Decoding; Dictionaries; Encoding; Indexes; Pattern matching; Compressed Matching; Suffix Trees; Tunstall;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference (DCC), 2011
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
978-1-61284-279-0
Type
conf
DOI
10.1109/DCC.2011.21
Filename
5749472
Link To Document