Pedestrian detection aided by deep learning semantic tasks

Author

Yonglong Tian;Ping Luo;Xiaogang Wang;Xiaoou Tang

Author_Institution

Department of Information Engineering, The Chinese University of Hong Kong, China

fYear

2015

fDate

6/1/2015 12:00:00 AM

Firstpage

5079

Lastpage

5087

Abstract

Deep learning methods have achieved great successes in pedestrian detection, owing to its ability to learn discriminative features from raw pixels. However, they treat pedestrian detection as a single binary classification task, which may confuse positive with hard negative samples (Fig.1 (a)). To address this ambiguity, this work jointly optimize pedestrian detection with semantic tasks, including pedestrian attributes (e.g. `carrying backpack´) and scene attributes (e.g. `vehicle´, `tree´, and `horizontal´). Rather than expensively annotating scene attributes, we transfer attributes information from existing scene segmentation datasets to the pedestrian dataset, by proposing a novel deep model to learn high-level features from multiple tasks and multiple data sources. Since distinct tasks have distinct convergence rates and data from different datasets have different distributions, a multi-task deep model is carefully designed to coordinate tasks and reduce discrepancies among datasets. Extensive evaluations show that the proposed approach outperforms the state-of-the-art on the challenging Caltech [9] and ETH [10] datasets where it reduces the miss rates of previous deep models by 17 and 5.5 percent, respectively.

Keywords

"Feature extraction","Semantics","Barium","Vehicles","Detectors","Machine learning","Data models"

Publisher

ieee

Conference_Titel

Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on

Electronic_ISBN

1063-6919

Type

conf

DOI

10.1109/CVPR.2015.7299143

Filename

7299143