Title :
Learning and matching multiscale template descriptors for real-time detection, localization and tracking
Author :
Lee, Taehee ; Soatto, Stefano
Author_Institution :
Comput. Sci. Dept., Univ. of California, Los Angeles, CA, USA
Abstract :
We describe a system to learn an object template from a video stream, and localize and track the corresponding object in live video. The template is decomposed into a number of local descriptors, thus enabling detection and tracking in spite of partial occlusion. Each local descriptor aggregates contrast invariant statistics (normalized intensity and gradient orientation) across scales, in a way that enables matching under significant scale variations. Low-level tracking during the training video sequence enables capturing object-specific variability due to the shape of the object, which is encapsulated in the descriptor. Salient locations on both the template and the target image are used as hypotheses to expedite matching.
Keywords :
gradient methods; image matching; image sequences; object detection; statistical analysis; target tracking; video signal processing; video streaming; contrast invariant statistics; gradient orientation; live video; local descriptors; multiscale template descriptors matching; normalized intensity; object template learn; object-specific variability; partial occlusion; real-time detection; real-time localization; real-time tracking; scale variations; target image; template image; training video sequence; video stream; Detectors; Histograms; Lighting; Robustness; Target tracking; Testing; Training;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Conference_Location :
Providence, RI
Print_ISBN :
978-1-4577-0394-2
DOI :
10.1109/CVPR.2011.5995453