Title :
Pyramid-Based Visual Tracking Using Sparsity Represented Mean Transform
Author :
Zhe Zhang ; Kin Hong Wong
Author_Institution :
Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong, Hong Kong, China
Abstract :
In this paper, we propose a robust method for visual tracking relying on mean shift, sparse coding and spatial pyramids. Firstly, we extend the original mean shift approach to handle orientation space and scale space and name this new method as mean transform. The mean transform method estimates the motion, including the location, orientation and scale, of the interested object window simultaneously and effectively. Secondly, a pixel-wise dense patch sampling technique and a region-wise trivial template designing scheme are introduced which enable our approach to run very accurately and efficiently. In addition, instead of using either holistic representation or local representation only, we apply spatial pyramids by combining these two representations into our approach to deal with partial occlusion problems robustly. Observed from the experimental results, our approach outperforms state-of-the-art methods in many benchmark sequences.
Keywords :
image coding; image representation; motion estimation; object tracking; wavelet transforms; holistic representation; local representation; mean shift approach; motion estimation; object window; orientation space handling; partial occlusion problems; pixelwise dense patch sampling technique; pyramid-based visual tracking; region wise trivial template designing scheme; scale space handling; sparse coding; sparsity represented mean transform; spatial pyramids; Dictionaries; Histograms; Kernel; Target tracking; Transforms; Visualization;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
DOI :
10.1109/CVPR.2014.160