DocumentCode
3672491
Title
Taking a deeper look at pedestrians
Author
Jan Hosang;Mohamed Omran;Rodrigo Benenson;Bernt Schiele
Author_Institution
Max Planck Institute for Informatics, Saarbrü
fYear
2015
fDate
6/1/2015 12:00:00 AM
Firstpage
4073
Lastpage
4082
Abstract
In this paper we study the use of convolutional neural networks (convnets) for the task of pedestrian detection. Despite their recent diverse successes, convnets historically underperform compared to other pedestrian detectors. We deliberately omit explicitly modelling the problem into the network (e.g. parts or occlusion modelling) and show that we can reach competitive performance without bells and whistles. In a wide range of experiments we analyse small and big convnets, their architectural choices, parameters, and the influence of different training data, including pretraining on surrogate tasks. We present the best convnet detectors on the Caltech and KITTI dataset. On Caltech our convnets reach top performance both for the Caltech1x and Caltech10x training setup. Using additional data at training time our strongest convnet model is competitive even to detectors that use additional data (optical flow) at test time.
Keywords
"Proposals","Training","Detectors","Training data","Neural networks","Image edge detection","Videos"
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2015.7299034
Filename
7299034
Link To Document