Title :
Audio, video and audio-visual signatures for short video clip detection: experiments on Trecvid2003
Author :
Senechal, Benjamin ; Pellerin, Denis ; Besacier, Laurent ; Simand, Isabelle ; Brès, Stéphane
Author_Institution :
LIS, Grenoble, France
Abstract :
In this paper, we present the association of audio and video signatures for short video clip detection. First, we present an audio signature based on the spectral flatness measure. Then we describe a spatio-temporal video signature, based on the evolution of gray level centroids over time. The major contribution of this work is the association of these two signatures in a so-called audiovisual signature by late integration of similarity measures obtained on both modalities. Our experiments conducted on a large video database (28 Gb/34 h extracted from TRECVID2003) show that our audio-visual signature is more robust than the audio-only or video-only signatures, and also permits better detection of video clips of shorter duration (about 2 seconds).
Keywords :
audio databases; audio signal processing; spatiotemporal phenomena; video databases; video retrieval; video signal processing; Trecvid2003; audio-visual signature; gray level centroid; spatio-temporal signature; video clip detection; video database; Audio databases; Automatic speech recognition; Content based retrieval; Data mining; Feature extraction; Fingerprint recognition; MPEG 7 Standard; Multimedia databases; Robustness; Weather forecasting;
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Print_ISBN :
0-7803-9331-7
DOI :
10.1109/ICME.2005.1521400