DocumentCode :
1748712
Title :
Prefix trees: new efficient data structures for matching strings of different lengths
Author :
Yazdani, Nasser ; Min, Paul S.
Author_Institution :
Erlang Technol. Inc., St. Louis, MO, USA
fYear :
2001
fDate :
2001
Firstpage :
76
Lastpage :
85
Abstract :
Prefix matching is an essential part of some applications. A well known application of prefix matching is layers 3 and 4 switching in TCP/IP protocols. It is assumed that there are strings of an alphabet Σ which are ordered. The strings may have different lengths and some may be prefixes of others. We introduce a simple scheme for comparing and sorting strings of different lengths first. Then, data is manipulated and the well-known tree structures are tuned to handle the prefix matching queries. A string prefix represents a data space that includes all strings for which it is a prefix. Using this concept, first, we propose a binary prefix tree and extended it to two m-way tree structures, static m-way prefix tree and dynamic m-way prefix tree later. The m-way trees have better search performance at the expense of some memory space. The static m-way prefix is more suitable for an environment with few data transactions and is expected to give better memory usage and create a more compact tree structure. The dynamic m-way prefix tree is a superset of B tree and is suited better for an environment with large transactions. All data structures share a common property: no data string can be at a higher level than its prefix in the data set. The proposed data structures are simple, well defined, easy to implement in hardware or software and efficient in terms of memory usage and search time
Keywords :
query processing; sorting; string matching; transaction processing; tree data structures; B tree; TCP/IP protocols; alphabet; binary prefix tree; compact tree structure; data space; data string; data structure; data structures; data transactions; dynamic m-way prefix tree; large transactions; m-way tree structures; memory space; memory usage; ordered strings; prefix matching; prefix matching queries; prefix trees; search performance; search time; sorting; static m-way prefix tree; string matching; superset; Application software; Data structures; Databases; Information filtering; Information filters; Internet; Routing; Search engines; TCPIP; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Engineering and Applications, 2001 International Symposium on.
Conference_Location :
Grenoble
Print_ISBN :
0-7695-1140-6
Type :
conf
DOI :
10.1109/IDEAS.2001.938073
Filename :
938073
Link To Document :
بازگشت