DocumentCode
3672149
Title
End-to-end integration of a Convolutional Network, Deformable Parts Model and non-maximum suppression
Author
Li Wan;David Eigen;Rob Fergus
Author_Institution
Dept. of Computer Science, Courant Institute, New York University, 251 Mercer Street, 10012, USA
fYear
2015
fDate
6/1/2015 12:00:00 AM
Firstpage
851
Lastpage
859
Abstract
Deformable Parts Models and Convolutional Networks each have achieved notable performance in object detection. Yet these two approaches find their strengths in complementary areas: DPMs are well-versed in object composition, modeling fine-grained spatial relationships between parts; likewise, ConvNets are adept at producing powerful image features, having been discriminatively trained directly on the pixels. In this paper, we propose a new model that combines these two approaches, obtaining the advantages of each. We train this model using a new structured loss function that considers all bounding boxes within an image, rather than isolated object instances. This enables the non-maximal suppression (NMS) operation, previously treated as a separate post-processing stage, to be integrated into the model. This allows for discriminative training of our combined Convnet + DPM + NMS model in end-to-end fashion. We evaluate our system on PASCAL VOC 2007 and 2011 datasets, achieving competitive results on both benchmarks.
Keywords
"Training","Feature extraction","Deformable models","Convolution","Computational modeling","Detectors","Context"
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2015.7298686
Filename
7298686
Link To Document