• DocumentCode
    2417366
  • Title

    Non-uniform partition strategies for indexing high-dimensional data with different distributions

  • Author

    Wang, Ben ; Gan, Qiang

  • Author_Institution
    Dept. of Comput. Sci., Essex Univ., Colchester, UK
  • fYear
    2003
  • fDate
    10-12 Dec. 2003
  • Firstpage
    13
  • Lastpage
    20
  • Abstract
    Efficient high-dimensional data indexing algorithms are crucial for image retrieval in large datasets. One of the state-of-the-art indexing methods is vector approximation file (VA-file), which indexes high-dimensional data by filtering feature vectors so that only a small fraction of them are visited in the search process. The VA-file uses a partition strategy that divides the data space on every dimension to make each partition equally full and assigns a same number of bits to each dimension. However, the strategy is not efficient to image datasets where the number of different vector components (granularity) in each dimension is largely diverse. The first two partition strategies are implemented in a practical way according to the description from the original VA-file method. The other two nonuniform partition strategies are proposed to resolve the problems of reduplicate coordinates and uniform bits assignment for each dimension, which assign more bits to represent dimensions with more vector components. Experimental results have shown that these strategies largely improve the performance of the VA-file for nonuniform datasets in terms of query time and filtering efficiency.
  • Keywords
    database indexing; image retrieval; very large databases; visual databases; dimension granularity; feature vector filtering efficiency; high-dimensional data indexing algorithm; image dataset; image retrieval; nonuniform partition strategy; query time; uniform bit assignment; vector approximation file; Clustering algorithms; Computer science; Degradation; Delay; Filtering; Gallium nitride; Image retrieval; Indexing; Information retrieval; Partitioning algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Software Engineering, 2003. Proceedings. Fifth International Symposium on
  • Conference_Location
    Taichung, Taiwan
  • Print_ISBN
    0-7695-2031-6
  • Type

    conf

  • DOI
    10.1109/MMSE.2003.1254417
  • Filename
    1254417