DocumentCode
3672599
Title
Pedestrian detection aided by deep learning semantic tasks
Author
Yonglong Tian;Ping Luo;Xiaogang Wang;Xiaoou Tang
Author_Institution
Department of Information Engineering, The Chinese University of Hong Kong, China
fYear
2015
fDate
6/1/2015 12:00:00 AM
Firstpage
5079
Lastpage
5087
Abstract
Deep learning methods have achieved great successes in pedestrian detection, owing to its ability to learn discriminative features from raw pixels. However, they treat pedestrian detection as a single binary classification task, which may confuse positive with hard negative samples (Fig.1 (a)). To address this ambiguity, this work jointly optimize pedestrian detection with semantic tasks, including pedestrian attributes (e.g. `carrying backpack´) and scene attributes (e.g. `vehicle´, `tree´, and `horizontal´). Rather than expensively annotating scene attributes, we transfer attributes information from existing scene segmentation datasets to the pedestrian dataset, by proposing a novel deep model to learn high-level features from multiple tasks and multiple data sources. Since distinct tasks have distinct convergence rates and data from different datasets have different distributions, a multi-task deep model is carefully designed to coordinate tasks and reduce discrepancies among datasets. Extensive evaluations show that the proposed approach outperforms the state-of-the-art on the challenging Caltech [9] and ETH [10] datasets where it reduces the miss rates of previous deep models by 17 and 5.5 percent, respectively.
Keywords
"Feature extraction","Semantics","Barium","Vehicles","Detectors","Machine learning","Data models"
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2015.7299143
Filename
7299143
Link To Document