Title :
A contextual model for semantic video structuring
Author :
Janvier, Bruno ; Bruno, Eric ; Marchand-Maillet, Stephane ; Pun, Thierry
Author_Institution :
Comput. Sci. Dept., Univ. of Geneva, Geneva, Switzerland
Abstract :
The problem of semantic video structuring is vital for automated management of large video collections. The goal is to automatically extract from the raw data the inner structure of a video collection; so that a whole new range of applications to browse and search video collections can be derived out of this high-level segmentation. To reach this goal, we exploit techniques that consider the full spectrum of video content; it is fundamental to properly integrate technologies from the fields of computer vision, audio analysis, natural language processing and machine learning. In this paper, a multimodal feature vector providing a rich description of the audio, visual and text modalities is first constructed. Boosted Random Fields are then used to learn two types of relationships : between features and labels and between labels associated with various modalities for improved consistency of the results. The parameters of this enhanced model are found iteratively by using two successive stages of Boosting. We experimented using the TRECvid corpus and show results that validate the approach over existing studies.
Keywords :
image segmentation; random processes; vectors; video signal processing; video streaming; audio modality; boosted random fields; high-level segmentation; multimodal feature vector; semantic video structuring; text modality; visual modality; Context; Context modeling; Frequency measurement; Hidden Markov models; Semantics; Streaming media; Visualization;
Conference_Titel :
Signal Processing Conference, 2005 13th European
Conference_Location :
Antalya
Print_ISBN :
978-160-4238-21-1