DocumentCode
239452
Title
Saliency map driven image retrieval combining the bag-of-words model and PLSA
Author
Giouvanakis, Emmanouil ; Kotropoulos, Constantine
Author_Institution
Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki, Greece
fYear
2014
fDate
20-23 Aug. 2014
Firstpage
280
Lastpage
285
Abstract
A new image retrieval system is proposed that combines the bag-of-words (BoW) model and Probabilistic Latent Semantic Analysis (PLSA). First, interest points on images are detected using the Hessian-Affine keypoint detector and Scale Invariant Feature Transform (SIFT) descriptors are computed. Graph-based visual saliency maps are then employed in order to detect and discard outliers in image descriptors. By doing so, SIFT features lying in non-salient regions can be deleted. All the remaining reliable feature descriptors are divided into a number of subsets and partial vocabularies are extracted for each of them. The final vocabulary used in the BoW model is obtained by the concatenating the partial vocabularies. The resulting BoW representations are weighted using the TF-IDF scheme. Finally, the PLSA is employed to perform a probabilistic mixture decomposition of the weighted BoW representations. Query expansion is demonstrated to improve the retrieval quality. Overall a 0.79 mean average precision is reported when the saliency filtering was applied on SIFTs and the BoW plus PLSA method was used.
Keywords
graph theory; image representation; image retrieval; object detection; probability; BoW model; Hessian-Affine keypoint detector; PLSA method; SIFT descriptors; TF-IDF scheme; bag-of-words model; graph-based visual saliency maps; image descriptors; nonsalient regions; partial vocabulary extraction; probabilistic latent semantic analysis; probabilistic mixture decomposition; query expansion; reliable feature descriptors; saliency filtering; saliency map driven image retrieval; scale invariant feature transform; weighted BoW representations; Digital signal processing; Feature extraction; Image retrieval; Probabilistic logic; Vectors; Visualization; Vocabulary; bag of words; graph-based visual saliency; image retrieval; object retrieval; probabilistic latent semantic analysis; query expansion;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Signal Processing (DSP), 2014 19th International Conference on
Conference_Location
Hong Kong
Type
conf
DOI
10.1109/ICDSP.2014.6900671
Filename
6900671
Link To Document