• DocumentCode
    2133155
  • Title

    Distributed multidimensional suffix arrays for string search

  • Author

    Fellah, Abdelaziz ; Mawson, Ronald

  • Author_Institution
    Dept. of Math. & Comput. Sci., Lethbridge Univ., Alta., Canada
  • Volume
    2
  • fYear
    2003
  • fDate
    28-30 Aug. 2003
  • Firstpage
    792
  • Abstract
    Although the classical B-tree and its derivatives are space efficient data structures for small string documents, they require many structures and an excessive amount of pointer manipulations for efficiently processing and searching large documents. In this paper, we propose a new and conceptually powerful data structure, called multidimensional suffix arrays (MSA), for string search. This data structure combines the features of suffix arrays and multilevel structures into a new approach which is very amenable to implementation on both conventional and clustered computers. Algorithms for searching, updating, reorganizing, and distributing multidimensional suffix arrays are discussed. Moreover, MSA achieve a significant improvement over string B-trees.
  • Keywords
    data structures; search engines; search problems; workstation clusters; clustered computer; data structure; distributed multidimensional suffix array; multilevel structure; reorganizing algorithm; searching algorithm; string search; updating algorithm; Clustering algorithms; Data structures; Distributed computing; Genetics; Indexing; Information analysis; Multidimensional systems; Pattern analysis; Sorting; Tree data structures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Computers and signal Processing, 2003. PACRIM. 2003 IEEE Pacific Rim Conference on
  • Print_ISBN
    0-7803-7978-0
  • Type

    conf

  • DOI
    10.1109/PACRIM.2003.1235900
  • Filename
    1235900