• DocumentCode
    239452
  • Title

    Saliency map driven image retrieval combining the bag-of-words model and PLSA

  • Author

    Giouvanakis, Emmanouil ; Kotropoulos, Constantine

  • Author_Institution
    Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki, Greece
  • fYear
    2014
  • fDate
    20-23 Aug. 2014
  • Firstpage
    280
  • Lastpage
    285
  • Abstract
    A new image retrieval system is proposed that combines the bag-of-words (BoW) model and Probabilistic Latent Semantic Analysis (PLSA). First, interest points on images are detected using the Hessian-Affine keypoint detector and Scale Invariant Feature Transform (SIFT) descriptors are computed. Graph-based visual saliency maps are then employed in order to detect and discard outliers in image descriptors. By doing so, SIFT features lying in non-salient regions can be deleted. All the remaining reliable feature descriptors are divided into a number of subsets and partial vocabularies are extracted for each of them. The final vocabulary used in the BoW model is obtained by the concatenating the partial vocabularies. The resulting BoW representations are weighted using the TF-IDF scheme. Finally, the PLSA is employed to perform a probabilistic mixture decomposition of the weighted BoW representations. Query expansion is demonstrated to improve the retrieval quality. Overall a 0.79 mean average precision is reported when the saliency filtering was applied on SIFTs and the BoW plus PLSA method was used.
  • Keywords
    graph theory; image representation; image retrieval; object detection; probability; BoW model; Hessian-Affine keypoint detector; PLSA method; SIFT descriptors; TF-IDF scheme; bag-of-words model; graph-based visual saliency maps; image descriptors; nonsalient regions; partial vocabulary extraction; probabilistic latent semantic analysis; probabilistic mixture decomposition; query expansion; reliable feature descriptors; saliency filtering; saliency map driven image retrieval; scale invariant feature transform; weighted BoW representations; Digital signal processing; Feature extraction; Image retrieval; Probabilistic logic; Vectors; Visualization; Vocabulary; bag of words; graph-based visual saliency; image retrieval; object retrieval; probabilistic latent semantic analysis; query expansion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing (DSP), 2014 19th International Conference on
  • Conference_Location
    Hong Kong
  • Type

    conf

  • DOI
    10.1109/ICDSP.2014.6900671
  • Filename
    6900671