Title :
Weighted bag of visual words for object recognition
Author :
San Biagio, Marco ; Bazzani, Loris ; Cristani, Matteo ; Murino, Vittorio
Author_Institution :
Pattern Anal. & Comput. Vision, Ist. Italiano di Tecnol., Genoa, Italy
Abstract :
Bag of Visual words (BoV) is one of the most successful strategy for object recognition, used to represent an image as a vector of counts using a learned vocabulary. This strategy assumes that the representation is built using patches that are either densely extracted or sampled from the images using feature detectors. However, the dense strategy captures also the noisy background information, whereas the feature detection strategy can lose important parts of the objects. In this paper we propose a solution in-between these two strategies, by densely extracting patches from the image, and weighting them accordingly to their salience. Intuitively, highly salient patches have an important role in describing an object, while those with low saliency are still taken with low emphasis, instead of discarding them. We embed this idea in the word encoding mechanism adopted in the BoV approaches. The technique is successfully applied to vector quantization and Fisher vector, on Caltech-101 and Caltech-256.
Keywords :
feature extraction; image representation; learning (artificial intelligence); object recognition; vector quantisation; BoV approaches; Caltech-101; Caltech-256; Fisher vector; dense strategy; feature detectors; highly salient patches; image representation; learned vocabulary; noisy background information; object recognition; vector quantization; weighted bag of visual words; Encoding; Feature extraction; Object recognition; Pipelines; Training; Vectors; Visualization; dictionary learning; feature weighting; object recognition; visual saliency;
Conference_Titel :
Image Processing (ICIP), 2014 IEEE International Conference on
Conference_Location :
Paris
DOI :
10.1109/ICIP.2014.7025553