• DocumentCode
    806256
  • Title

    Toward efficient multifeature query processing

  • Author

    Jagadish, H.V. ; Ooi, Beng Chin ; Shen, Heng Tao ; Tan, Kian-Lee

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
  • Volume
    18
  • Issue
    3
  • fYear
    2006
  • fDate
    3/1/2006 12:00:00 AM
  • Firstpage
    350
  • Lastpage
    362
  • Abstract
    In many advanced applications, data are described by multiple high-dimensional features. Moreover, different queries may weight these features differently; some may not even specify all the features. In this paper, we propose our solution to support efficient query processing in these applications. We devise a novel representation that compactly captures f features into two components. The first component is a 2D vector that reflects a distance range (minimum and maximum values) of the f features with respect to a reference point (the center of the space) in a metric space and the second component is a bit signature, with two bits per dimension, obtained by analyzing each feature´s descending energy histogram. This representation enables two levels of filtering: the first component prunes away points that do not share similar distance ranges, while the bit signature filters away points based on the dimensions of the relevant features. Moreover, the representation facilitates the use of a single index structure to further speed up processing. We employ the classical B+-tree for this purpose. We also propose a KNN search algorithm that exploits the access orders of critical dimensions of highly selective features and partial distances to prune the search space more effectively. Our extensive experiments on both real-life and synthetic data sets show that the proposed solution offers significant performance advantages over sequential scan and retrieval methods using single and multiple VA-files.
  • Keywords
    database indexing; query processing; tree searching; trees (mathematics); 2D vector; B+-tree; KNN search algorithm; bit signature; index structure; multifeature query processing; Application software; Computer Society; Energy capture; Filtering; Functional analysis; Histograms; Indexing; Information retrieval; Nearest neighbor searches; Query processing; Index Terms- Multifeature; high-dimensional; indexing; query processing; weighted query.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2006.51
  • Filename
    1583584