DocumentCode :
3601476
Title :
Multi-View and 3D Deformable Part Models
Author :
Pepik, Bojan ; Stark, Michael ; Gehler, Peter ; Schiele, Bernt
Author_Institution :
Max Planck Inst. for Inf., Saarbrucken, Germany
Volume :
37
Issue :
11
fYear :
2015
Firstpage :
2232
Lastpage :
2245
Abstract :
As objects are inherently 3D, they have been modeled in 3D in the early days of computer vision. Due to the ambiguities arising from mapping 2D features to 3D models, 3D object representations have been neglected and 2D feature-based models are the predominant paradigm in object detection nowadays. While such models have achieved outstanding bounding box detection performance, they come with limited expressiveness, as they are clearly limited in their capability of reasoning about 3D shape or viewpoints. In this work, we bring the worlds of 3D and 2D object representations closer, by building an object detector which leverages the expressive power of 3D object representations while at the same time can be robustly matched to image evidence. To that end, we gradually extend the successful deformable part model [1] to include viewpoint information and part-level 3D geometry information, resulting in several different models with different level of expressiveness. We end up with a 3D object model, consisting of multiple object parts represented in 3D and a continuous appearance model. We experimentally verify that our models, while providing richer object hypotheses than the 2D object models, provide consistently better joint object localization and viewpoint estimation than the state-of-the-art multi-view and 3D object detectors on various benchmarks (KITTI [2], 3D object classes [3], Pascal3D+ [4], Pascal VOC 2007 [5], EPFL multi-view cars [6]).
Keywords :
computer vision; image representation; object detection; solid modelling; 2D feature mapping; 2D feature-based models; 2D object representation; 3D deformable part model; 3D object model; 3D object representations; bounding box detection performance; computer vision; joint object localization; multiview model; object detection; part-level 3D geometry information; viewpoint estimation; Deformable models; Detectors; Feature extraction; Geometry; Object detection; Solid modeling; Three-dimensional displays; 3D object models; Object detection; deformable part models; structured output learning;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2015.2408347
Filename :
7053926
Link To Document :
بازگشت