• DocumentCode
    16563
  • Title

    The BoND-Tree: An Efficient Indexing Method for Box Queries in Nonordered Discrete Data Spaces

  • Author

    Changqing Chen ; Watve, Alok ; Pramanik, Sarah ; Qiang Zhu

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI, USA
  • Volume
    25
  • Issue
    11
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    2629
  • Lastpage
    2643
  • Abstract
    Box queries (or window queries) are a type of query which specifies a set of allowed values in each dimension. Indexing feature vectors in the multidimensional Nonordered Discrete Data Spaces (NDDS) for efficient box queries are becoming increasingly important in many application domains such as genome sequence databases. Most of the existing work in this field targets the similarity queries (range queries and k-NN queries). Box queries, however, are fundamentally different from similarity queries. Hence, the same indexing schemes designed for similarity queries may not be efficient for box queries. In this paper, we present a new indexing structure specifically designed for box queries in the NDDS. Unique characteristics of the NDDS are exploited to develop new node splitting heuristics. For the BoND-tree, we also provide theoretical analysis to show the optimality of the proposed heuristics. Extensive experiments with synthetic data demonstrate that the proposed scheme is significantly more efficient than the existing ones when applied to support box queries in NDDSs. We also show effectiveness of the proposed scheme in a real-world application of primer design for genome sequence databases.
  • Keywords
    biology computing; genomics; indexing; query processing; BoND-Tree; NDDS; box queries; genome sequence databases; indexing method; indexing schemes; multidimensional nonordered discrete data spaces; node splitting heuristics; similarity queries; Bioinformatics; Filtering; Genomics; Indexing; Vectors; Box query; categorical data; indexing; nonordered discrete data;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.132
  • Filename
    6604398