Title :
Video Primal Sketch: A generic middle-level representation of video
Author :
Han, Zhi ; Xu, Zongben ; Zhu, Song-Chun
Author_Institution :
Inst. for Inf. & Syst. Sci., Xi´´an Jiaotong Univ., Xi´´an, China
Abstract :
This paper presents a middle-level video representation named Video Primal Sketch (VPS), which integrates two regimes of models: i) sparse coding model using static or moving primitives to explicitly represent moving corners, lines, feature points, etc., ii) FRAME/MRF model with spatio-temporal filters to implicitly represent textured motion, such as water and fire, by matching feature statistics, i.e. histograms. This paper makes three contributions: i) learning a dictionary of video primitives as parametric generative model; ii) studying the Spatio-Temporal FRAME (ST-FRAME) model for modeling and synthesizing textured motion; and iii) developing a parsimonious hybrid model for generic video representation. VPS selects the proper representation automatically and is compatible with high-level action representations. In the experiments, we synthesize a series of dynamic textures, reconstruct real videos and show varying VPS over the change of densities causing by the scale transition in videos.
Keywords :
feature extraction; image matching; image motion analysis; image texture; learning (artificial intelligence); video signal processing; FRAME-MRF model; feature statistics matching; generic middle-level video representation; moving primitives; parametric generative model; parsimonious hybrid model; sparse coding model; spatio-temporal FRAME model; spatio-temporal filters; static primitives; textured motion synthesis; video primal sketch; video primitives dictionary learning; Bismuth; Dictionaries; Dynamics; Encoding; Histograms; Image reconstruction; Tracking;
Conference_Titel :
Computer Vision (ICCV), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4577-1101-5
DOI :
10.1109/ICCV.2011.6126380