• DocumentCode
    2080518
  • Title

    Approximate string search in spatial databases

  • Author

    Yao, Bin ; Li, Feifei ; Hadjieleftheriou, Marios ; Hou, Kun

  • Author_Institution
    Comput. Sci. Dept., Florida State Univ., Tallahassee, FL, USA
  • fYear
    2010
  • fDate
    1-6 March 2010
  • Firstpage
    545
  • Lastpage
    556
  • Abstract
    This work presents a novel index structure, MHR-tree, for efficiently answering approximate string match queries in large spatial databases. The MHR-tree is based on the R-tree augmented with the min-wise signature and the linear hashing technique. The min-wise signature for an index node u keeps a concise representation of the union of q-grams from strings under the sub-tree of u. We analyze the pruning functionality of such signatures based on set resemblance between the query string and the q-grams from the sub-trees of index nodes. MHR-tree supports a wide range of query predicates efficiently, including range and nearest neighbor queries. We also discuss how to estimate range query selectivity accurately. We present a novel adaptive algorithm for finding balanced partitions using both the spatial and string information stored in the tree. Extensive experiments on large real data sets demonstrate the efficiency and effectiveness of our approach.
  • Keywords
    tree data structures; visual databases; MHR-tree; index nodes subtrees; index structure; linear hashing technique; spatial databases string search; string match queries; Adaptive algorithm; Computer science; Costs; Dynamic programming; Indexes; Keyword search; Nearest neighbor searches; Spatial databases; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2010 IEEE 26th International Conference on
  • Conference_Location
    Long Beach, CA
  • Print_ISBN
    978-1-4244-5445-7
  • Electronic_ISBN
    978-1-4244-5444-0
  • Type

    conf

  • DOI
    10.1109/ICDE.2010.5447836
  • Filename
    5447836