Title :
Reconfigurable templates for robust vehicle detection and classification
Author :
Lv, Yang ; Yao, Benjamin ; Wang, Yongtian ; Zhu, Song-Chun
Author_Institution :
Key Lab. of Photoelectronic Imaging Technol. & Syst., BIT, USA
Abstract :
In this paper, we learn a reconfigurable template for detecting vehicles and classifying their types. We adopt a popular design for the part based model that has one coarse template covering entire object window and several small high-resolution templates representing parts. The reconfigurable template can learn part configurations that capture the spatial correlation of features for a deformable part based model. The features of templates are Histograms of Gradients (HoG). In order to better describe the actual dimensions and locations of “parts” (i.e. features with strong spatial correlations), we design a dictionary of rectangular primitives of various sizes, aspect-ratios and positions. A configuration is defined as a subset of non-overlapping primitives from this dictionary. To learn the optimal configuration using SVM amounts, we need to find the subset of parts that minimize the regularized hinge loss, which leads to a non-convex optimization problem. We solve this problem by replacing the hinge loss with a negative sigmoid loss that can be approximately decomposed into losses (or negative sigmoid scores) of individual parts. In the experiment, we compare our method empirically with group lasso and a state of the art method [7] and demonstrate that models learned with our method outperform others on two computer vision applications: vehicle localization and vehicle model recognition.
Keywords :
computer vision; concave programming; image classification; learning (artificial intelligence); minimisation; object detection; object recognition; road vehicles; support vector machines; traffic engineering computing; SVM; actual parts dimensions; actual parts locations; computer vision applications; deformable part based model; feature spatial correlation; group lasso; high-resolution templates; histograms of gradients; negative sigmoid loss; nonconvex optimization problem; nonoverlapping primitives; object window; part configuration learning; reconfigurable template learning; rectangular primitive dictionary design; regularized hinge loss minimization; vehicle classification; vehicle detection; vehicle localization; vehicle model recognition; Computational modeling; Deformable models; Dictionaries; Feature extraction; Heuristic algorithms; Vectors; Vehicles;
Conference_Titel :
Applications of Computer Vision (WACV), 2012 IEEE Workshop on
Conference_Location :
Breckenridge, CO
Print_ISBN :
978-1-4673-0233-3
Electronic_ISBN :
1550-5790
DOI :
10.1109/WACV.2012.6163016