DocumentCode
2133155
Title
Distributed multidimensional suffix arrays for string search
Author
Fellah, Abdelaziz ; Mawson, Ronald
Author_Institution
Dept. of Math. & Comput. Sci., Lethbridge Univ., Alta., Canada
Volume
2
fYear
2003
fDate
28-30 Aug. 2003
Firstpage
792
Abstract
Although the classical B-tree and its derivatives are space efficient data structures for small string documents, they require many structures and an excessive amount of pointer manipulations for efficiently processing and searching large documents. In this paper, we propose a new and conceptually powerful data structure, called multidimensional suffix arrays (MSA), for string search. This data structure combines the features of suffix arrays and multilevel structures into a new approach which is very amenable to implementation on both conventional and clustered computers. Algorithms for searching, updating, reorganizing, and distributing multidimensional suffix arrays are discussed. Moreover, MSA achieve a significant improvement over string B-trees.
Keywords
data structures; search engines; search problems; workstation clusters; clustered computer; data structure; distributed multidimensional suffix array; multilevel structure; reorganizing algorithm; searching algorithm; string search; updating algorithm; Clustering algorithms; Data structures; Distributed computing; Genetics; Indexing; Information analysis; Multidimensional systems; Pattern analysis; Sorting; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications, Computers and signal Processing, 2003. PACRIM. 2003 IEEE Pacific Rim Conference on
Print_ISBN
0-7803-7978-0
Type
conf
DOI
10.1109/PACRIM.2003.1235900
Filename
1235900
Link To Document