DocumentCode :
2133155
Title :
Distributed multidimensional suffix arrays for string search
Author :
Fellah, Abdelaziz ; Mawson, Ronald
Author_Institution :
Dept. of Math. & Comput. Sci., Lethbridge Univ., Alta., Canada
Volume :
2
fYear :
2003
fDate :
28-30 Aug. 2003
Firstpage :
792
Abstract :
Although the classical B-tree and its derivatives are space efficient data structures for small string documents, they require many structures and an excessive amount of pointer manipulations for efficiently processing and searching large documents. In this paper, we propose a new and conceptually powerful data structure, called multidimensional suffix arrays (MSA), for string search. This data structure combines the features of suffix arrays and multilevel structures into a new approach which is very amenable to implementation on both conventional and clustered computers. Algorithms for searching, updating, reorganizing, and distributing multidimensional suffix arrays are discussed. Moreover, MSA achieve a significant improvement over string B-trees.
Keywords :
data structures; search engines; search problems; workstation clusters; clustered computer; data structure; distributed multidimensional suffix array; multilevel structure; reorganizing algorithm; searching algorithm; string search; updating algorithm; Clustering algorithms; Data structures; Distributed computing; Genetics; Indexing; Information analysis; Multidimensional systems; Pattern analysis; Sorting; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Computers and signal Processing, 2003. PACRIM. 2003 IEEE Pacific Rim Conference on
Print_ISBN :
0-7803-7978-0
Type :
conf
DOI :
10.1109/PACRIM.2003.1235900
Filename :
1235900
Link To Document :
بازگشت