• DocumentCode
    2115452
  • Title

    Effective image database search via dimensionality reduction

  • Author

    Dahl, Anders Bjorholm ; Aanaes, Dahl Henrik

  • Author_Institution
    Inf. DTU, Tech. Univ. of Denmark, Lyngby
  • fYear
    2008
  • fDate
    23-28 June 2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Image search using the bag-of-words image representation is investigated further in this paper. This approach has shown promising results for large scale image collections making it relevant for Internet applications. The steps involved in the bag-of-words approach are feature extraction, vocabulary building, and searching with a query image. It is important to keep the computational cost low through all steps. In this paper we focus on the efficiency of the technique. To do that we substantially reduce the dimensionality of the features by the use of PCA and addition of color. Building of the visual vocabulary is typically done using k-means. We investigate a clustering algorithm based on the leader follower principle (LF-clustering), in which the number of clusters is not fixed. The adaptive nature of LF-clustering is shown to improve the quality of the visual vocabulary using this. In the query step, features from the query image are assigned to the visual vocabulary. The dimensionality reduction enables us to do exact feature labeling using kD-tree, instead of approximate approaches normally used. Despite the dimensionality reduction to between 6 and 15 dimensions we obtain improved results compared to the traditional bag-of-words approach based on 128 dimensional SIFT feature and k-means clustering.
  • Keywords
    feature extraction; image representation; image retrieval; principal component analysis; visual databases; clustering algorithm; dimensionality reduction; feature extraction; image database searching; image representation; kD-tree; leader follower principle; principal component analysis; vocabulary building; Clustering algorithms; Computational efficiency; Feature extraction; Image databases; Image representation; Internet; Labeling; Large-scale systems; Principal component analysis; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition Workshops, 2008. CVPRW '08. IEEE Computer Society Conference on
  • Conference_Location
    Anchorage, AK
  • ISSN
    2160-7508
  • Print_ISBN
    978-1-4244-2339-2
  • Electronic_ISBN
    2160-7508
  • Type

    conf

  • DOI
    10.1109/CVPRW.2008.4562957
  • Filename
    4562957