Title :
PartBook for image parsing
Author :
Yang, Kuiyuan ; Zhang, Lei ; Rui, Yong ; Zhang, Hong-Jiang
Author_Institution :
Dept. of Autom., Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
Effective image parsing needs a representation that is both selective (to inter-class variations) and invariant (to intra-class variations). CodeBook from bag-of-visual-words representation addresses the invariance, and part-based models can potentially address the selectivity. However, existing part-based approaches either require expensive manual object-level labeling or make strong assumptions not applicable to real-world images. In this paper, we propose a PartBook approach that simultaneously overcomes the above two difficulties. Furthermore, we present an effective framework that integrates CodeBook and PartBook, which achieves both intra-class invariance and inter-class selectivity. Specifically, a set of candidate regions are first selected from heat map-like representations obtained by a SVM classifier trained for each category. Then the regions are clustered based on the dense matching-based similarity, and a part detector is learned from each cluster and further refined by utilizing a latent SVM. The learned PartBook summarizes the most representative mid-level patterns of each category, and can be readily used for image parsing tasks to identify not only objects but also different parts of an object. Extensive experimental results on real-world images show that the automatically learned parts are semantically meaningful, and demonstrate the effectiveness of ParkBook in image parsing tasks at different levels.
Keywords :
image classification; image matching; image representation; object recognition; pattern clustering; support vector machines; CodeBook; PartBook; SVM classifier; bag-of-visual-words representation; dense matching-based similarity; heat map-like representations; image parsing; inter-class selectivity; inter-class variation; intra-class invariance; intra-class variation; invariant representation; manual object-level labeling; mid-level patterns; object identification; part detector; part-based models; region clustering; selective representation; Detectors; Heating; Optimization; Support vector machines; Training; Vectors; Visualization;
Conference_Titel :
Computer Vision and Pattern Recognition Workshops (CVPRW), 2012 IEEE Computer Society Conference on
Conference_Location :
Providence, RI
Print_ISBN :
978-1-4673-1611-8
Electronic_ISBN :
2160-7508
DOI :
10.1109/CVPRW.2012.6239169