DocumentCode :
253905
Title :
PANDA: Pose Aligned Networks for Deep Attribute Modeling
Author :
Ning Zhang ; Paluri, Manohar ; Ranzato, Marc´Aurelio ; Darrell, Trevor ; Bourdev, Lubomir
fYear :
2014
fDate :
23-28 June 2014
Firstpage :
1637
Lastpage :
1644
Abstract :
We propose a method for inferring human attributes (such as gender, hair style, clothes style, expression, action) from images of people under large variation of viewpoint, pose, appearance, articulation and occlusion. Convolutional Neural Nets (CNN) have been shown to perform very well on large scale object recognition problems. In the context of attribute classification, however, the signal is often subtle and it may cover only a small part of the image, while the image is dominated by the effects of pose and viewpoint. Discounting for pose variation would require training on very large labeled datasets which are not presently available. Part-based models, such as poselets [4] and DPM [12] have been shown to perform well for this problem but they are limited by shallow low-level features. We propose a new method which combines part-based models and deep learning by training pose-normalized CNNs. We show substantial improvement vs. state-of-the-art methods on challenging attribute classification tasks in unconstrained settings. Experiments confirm that our method outperforms both the best part-based methods on this problem and conventional CNNs trained on the full bounding box of the person.
Keywords :
image classification; learning (artificial intelligence); neural nets; pose estimation; DPM; PANDA; appearance variation; articulation variation; attribute classification; convolutional neural nets; deep learning; full bounding box; inferring human attributes; labeled datasets; object recognition problems; occlusion variation; part-based models; people images; pose aligned networks for deep attribute modeling; pose estimation; pose variation; poselets; shallow low-level features; training pose-normalized CNN; unconstrained settings; view-point variation; Convolution; Feature extraction; Glass; Hair; Neural networks; Object recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
Type :
conf
DOI :
10.1109/CVPR.2014.212
Filename :
6909608
Link To Document :
بازگشت