Title :
Object localization based on discriminative visual words
Author :
Fang, Su-wen ; Qu, Yan-yun ; Chen, Cheng ; Song, Shu-yang
Author_Institution :
Dept. of Comput. Sci., Xiamen Univ., Xiamen, China
Abstract :
This paper aims at learning discriminative visual words for object localization. These visual words are different from those learned from the generic object recognition which usually contain the negative visual words located on the background. For the purpose of object localization, the approach requires that the positive discriminative visual words mostly lie in the foreground object and the negative ones lie in the background. We firstly rank the visual words by the following three methods: the SVM classifier, the foreground likelihood ratio and the mutual information. Then, we integrate the three ranking results in an optimal combinational way and select the discriminative visual words by maximizing the object hit rate. Moreover, a coarse-to-fine detection framework to locate the object is designed. In the first stage, a branch-and-bound scheme combined with the discriminative visual words is implemented to find candidate object regions. In the second stage, a sliding window classifier is used to find the object location. The experimental results demonstrate that the approach is effective and efficient, and superior to Efficient Subwindow Search scheme.
Keywords :
document image processing; learning (artificial intelligence); natural language processing; object recognition; support vector machines; SVM classifier; branch-and-bound scheme; coarse-to-fine detection framework; foreground likelihood ratio; generic object recognition; learning; mutual information; negative visual words; object localization; optimal combinational way; positive discriminative visual words; Abstracts; Argon; Cascade; Classifiers; Efficient subwindow search; Feature selection; Local features;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location :
Xian
Print_ISBN :
978-1-4673-1484-8
DOI :
10.1109/ICMLC.2012.6359510