DocumentCode :
3294202
Title :
Learning Global and Reconfigurable Part-Based Models for Object Detection
Author :
Xi Song ; Wu, Tianfu ; Xie, Yi ; Jia, Yunde
Author_Institution :
Beijing Lab. of Intell. Inf. Technol., Beijing Inst. of Technol., Beijing, China
fYear :
2012
fDate :
9-13 July 2012
Firstpage :
13
Lastpage :
18
Abstract :
This paper presents a method of learning global and reconfigurable part-based models (RPM) for object detection. Recently, deformable part-based model (DPM) is widely used. A DPM consists of a root node and a collection of part nodes, which is learned under the latent SVM formulation by treating part nodes as hidden variables. Although the configuration of parts (i.e., the shapes, sizes and locations of parts) plays a major role in improving performance of object detection, it has not been addressed well in the literature. In this paper, we propose RPM to tackle it. A dictionary of part types is defined by enumerating rectangular shapes of different aspect ratios and sizes given the whole lattice (often at twice resolution of the root node), and each part type has a set of part instances when placed in the lattice. So, the configuration space of parts is quantized by the part types and part instances, and then organized into a hierarchical And-Or directed a cyclic graph (AOG). The AOG consists of three types of nodes: terminal nodes (i.e., part instances), And-nodes (representing decompositions of a part instance into two smaller ones) and Or-nodes (representing alternative ways of decompositions). The globally optimal configuration in the AOG is solved using dynamic programming (DP) where the classification error rates of terminal nodes and And-nodes are used as their figures of merit. In experiments, we test our method on the 20 object categories in the PASCAL VOC2007 dataset and obtain comparable performance with state-of-the-art methods.
Keywords :
directed graphs; dynamic programming; error statistics; image classification; learning (artificial intelligence); object detection; support vector machines; AOG; And- nodes; DP; DPM; Or-nodes; PASCAL VOC2007 dataset; RPM; classification error rates; configuration space; deformable part-based model; dynamic programming; global part-based models learning; globally optimal configuration; hierarchical And-Or directed acyclic graph; latent SVM formulation; object categories; object detection; reconfigurable part-based models learning; rectangular shapes; root node; terminal nodes; Deformable models; Error analysis; Lattices; Object detection; Shape; Support vector machines; Training; Deformable Part-based Model; Dynamic Programming; Latent SVM; Part Configuration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2012 IEEE International Conference on
Conference_Location :
Melbourne, VIC
ISSN :
1945-7871
Print_ISBN :
978-1-4673-1659-0
Type :
conf
DOI :
10.1109/ICME.2012.32
Filename :
6298367
Link To Document :
بازگشت