Title :
Multi-modal topic unit segmentation in videos using conditional random fields
Author :
Su Xu ; Bailan Feng ; Bo Xu
Author_Institution :
Interactive Digital Media Technol. Res. Center, Inst. of Autom., Beijing, China
Abstract :
In this paper a novel approach of video segmentation into topic units is presented. This approach is built upon the design in which topic unit segmentation is transformed into label identification problem by defining four types of shots that reveal semantic structure of it. To implement our algorithm, four middle-level features including shot difference signal, scene transition graph, shot theme and audio type are extracted to depict the label properties of each shot, and then CRFs model is employed to identify the labels sequence. CRFs model integrates context information, so it produces accurate results in topic unit segmentation. The proposed approach is verified by two types of data: documentary and news. Experiments on testing data set yield average 86% F-measure, which illustrates that the proposed method can accurately detect most topic units in different genres of programs.
Keywords :
feature extraction; graph theory; image segmentation; information resources; natural scenes; random processes; video signal processing; CRF model; audio type; conditional random fields; context information; documentary data:; label sequence identification; middle-level feature extraction; multimodal topic unit segmentation; news. data:; scene transition graph; shot difference signal; shot theme; video segmentation; Context; Feature extraction; Hidden Markov models; Prediction algorithms; Predictive models; Videos; Visualization; Conditional random field; Topic unit segmentation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6638062