Title :
Association of Audio and Video Segmentations for Automatic Person Indexing
Author :
El Khoury, Elie ; Jaffré, Gaël ; Pinquier, Julien ; Sénac, Christine
Author_Institution :
Paul Sabatier Univ., Toulouse
Abstract :
In the audiovisual indexing context, we propose and experiment a method that, from an audio speaker segmentation and a video costume segmentation made on an audiovisual document, makes an automatic association between each voice and the images containing the corresponding visual person. This association can be used as a preprocessing step for existing applications like person identification systems. The first step consists in fusing, without any a priori knowledge, the two indexes produced by audio and video segmentations, in order to make the information brought by each of them more robust. Evaluation is done on a corpus composed of French TV broadcasts. If both audio and video streams are correctly segmented, this automatic association yields excellent results. When the two streams are oversegmented, our system permits to detect the main persons in term of duration of appearance.
Keywords :
audio signal processing; biometrics (access control); document image processing; image recognition; image segmentation; indexing; speaker recognition; video signal processing; French TV broadcast; audio segmentation; audio speaker segmentation; audiovisual document analysis; audiovisual indexing; automatic person indexing; person identification system; video costume segmentation; video segmentation; Automatic speech recognition; Bayesian methods; Face detection; Face recognition; Image segmentation; Indexing; Loudspeakers; Robustness; Streaming media; TV broadcasting;
Conference_Titel :
Content-Based Multimedia Indexing, 2007. CBMI '07. International Workshop on
Conference_Location :
Bordeaux
Print_ISBN :
1-4244-1011-8
Electronic_ISBN :
1-4244-1011-8
DOI :
10.1109/CBMI.2007.385424