Title :
Hierarchical video clustering
Author :
Petrovic, Nemanja ; Jojic, Nebojsa ; Huang, Thomas S.
Author_Institution :
Inst. of Beckman, Illinois Univ., IL, USA
fDate :
29 Sept.-1 Oct. 2004
Abstract :
We present a novel generative model for video that models video as mixture of transformed video scenes. The learning procedure automatically clusters video frames into video scenes and objects. The learning algorithm is based on a hierarchical, on-line EM algorithm. Fast Fourier transform (FFT) is used for rapid computations in E and M step of the EM algorithm. We use the model to: 1. perform video clustering by grouping similar (up to translation and scale) video frames into clusters; 2. robustly stabilize video by inferring translation and scale intensity for each frame. We believe that video scene modeling of this kind is essential to bridge the "semantic gap" in video understanding. We illustrate this with several excellent results, both in terms of speed and accuracy.
Keywords :
fast Fourier transforms; learning (artificial intelligence); pattern clustering; video databases; video signal processing; fast Fourier transform; hierarchical video clustering; learning algorithm; online EM algorithm; semantic gap; transformed video scene; Bridges; Clustering algorithms; Fast Fourier transforms; Graphical models; Indexing; Information retrieval; Layout; Random variables; Robustness; Spatial databases;
Conference_Titel :
Multimedia Signal Processing, 2004 IEEE 6th Workshop on
Print_ISBN :
0-7803-8578-0
DOI :
10.1109/MMSP.2004.1436583