Title :
Multi-modality movie scene detection using Kernel Canonical Correlation Analysis
Author :
Guangyu Gao ; Huadong Ma
Author_Institution :
Beijing Key Lab. of Intell. Telecommun. Software & Multimedia, Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
Scene detection is the fundamental step for efficient accessing and browsing videos. In this paper, we propose to segment movie into scenes which utilizes fused visual and audio features. The movie is first segmented into shots by an accelerating algorithm, and the key frames are extracted later. While feature movies are often filmed in open and dynamic environments using moving cameras and have continuously changing contents, we focus on the association extraction of visual and audio features. Then, based on the Kernel Canonical Correlation Analysis (KCCA), all these features are fused for scene detection. Finally, spatial-temporal coherent shots construct the similarity graph which is partitioned to generate the scene boundaries. We conduct extensive experiments on several movies, and the results show that our approach can efficiently detect the scene boundaries with a satisfactory performance.
Keywords :
correlation methods; feature extraction; graph theory; image fusion; spatiotemporal phenomena; video cameras; video retrieval; KCCA; association extraction; audio features; continuously changing contents; dynamic environments; kernel canonical correlation analysis; key frame extraction; movie segmentation; moving cameras; multimodality movie scene detection; open environments; scene boundary generation; similarity graph; spatial-temporal coherent shots; video access; video browsing; visual features; Color; Correlation; Feature extraction; Histograms; Motion pictures; Videos; Visualization;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4