Title :
Image retrieval using accurate approximated inverse document frequency of geometry-preserving visual phrases
Author :
Wang, Fangyuan ; Zhang, Shuwu
Author_Institution :
High-Tech Innovation Center, Inst. of Autom., Beijing, China
Abstract :
The most popular approach for large scale image retrieval is to represent images using the bag-of-visual-word (BoV) model. Based on the typical BoV zhang et al. introduce the idea of geometry-preserving visual phrases (GVP) to encode spatial information [1]. Since GVP can only be generated in searching step, it´s impractical to know the inverse document frequency (idf) of GVP in advance. Zhang et al. define the idf of a GVP as the summation of idf weights of visual words in the GVP. But, this kind of approximation is prone to give a much larger value than its real one. In this paper, we propose to use the smallest idf of visual words in a GVP to approximate the idf of the GVP, while keep the efficiency of GVP searching process. Experiments on Oxford 5K and MIR FLICKER 1M datasets show that our approach can achieve better performance compared with GVP.
Keywords :
approximation theory; computational geometry; image representation; image retrieval; word processing; BoV model; GVP generation; GVP idf approximation; MIR FLICKER 1M dataset; Oxford 5K dataset; accurate approximated inverse document frequency; bag-of-visual-word model; geometry-preserving visual phrases; image representation; large-scale image retrieval; searching process; spatial information encoding; Accuracy; Approximation methods; Equations; Mathematical model; USA Councils; Visualization; Vocabulary;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
DOI :
10.1109/ICALIP.2012.6376744