Title :
Semi-supervised Learning of Feature Hierarchies for Object Detection in a Video
Author :
Yang Yang ; Guang Shu ; Shah, Mubarak
Author_Institution :
Center for Res. in Comput. Vision, Univ. of Central Florida, Orlando, FL, USA
Abstract :
We propose a novel approach to boost the performance of generic object detectors on videos by learning video-specific features using a deep neural network. The insight behind our proposed approach is that an object appearing in different frames of a video clip should share similar features, which can be learned to build better detectors. Unlike many supervised detector adaptation or detection-by-tracking methods, our method does not require any extra annotations or utilize temporal correspondence. We start with the high-confidence detections from a generic detector, then iteratively learn new video-specific features and refine the detection scores. In order to learn discriminative and compact features, we propose a new feature learning method using a deep neural network based on auto en-coders. It differs from the existing unsupervised feature learning methods in two ways: first it optimizes both discriminative and generative properties of the features simultaneously, which gives our features better discriminative ability, second, our learned features are more compact, while the unsupervised feature learning methods usually learn a redundant set of over-complete features. Extensive experimental results on person and horse detection show that significant performance improvement can be achieved with our proposed method.
Keywords :
feature extraction; neural nets; object detection; unsupervised learning; video signal processing; auto encoders; compact features; deep neural network; discriminative feature properties; feature hierarchies; feature learning method; generative feature properties; generic object detectors; high-confidence detection; horse detection; iterative learning; object detection scores; person detection; semisupervised learning; unsupervised feature learning method; video clip; video-specific feature learning; Detectors; Feature extraction; Image color analysis; Learning systems; Neural networks; Object detection; Training; Object detection; deep learning; feature learning; video analysis;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location :
Portland, OR
DOI :
10.1109/CVPR.2013.216