DocumentCode :
3087250
Title :
Speaker Segmentation of Interviews Using Integrated Video and Audio Change Detectors
Author :
Lagrange, Mathieu ; Martins, Luis Gustavo ; Teixeira, Luis F. ; Tzanetakis, George
Author_Institution :
Univ. of Victoria, Victoria
fYear :
2007
fDate :
25-27 June 2007
Firstpage :
219
Lastpage :
226
Abstract :
In this paper, we study the use of audio and visual cues to perform speaker segmentation of audiovisual recordings of formal meetings such as interviews, lectures, or courtroom sessions. The sole use of audio cues for such recordings can be ineffective due to low recording quality and high level of background noise. We propose to use additional cues from the video stream by exploiting the relative static locations of speakers among the scene. The experiments show that the combination of those multiple cues helps to identify more robustly the transitions among speakers.
Keywords :
audio recording; speech processing; video signal processing; video streaming; audio change detector; audio cues; audiovisual recording; formal meeting; speaker segmentation; video detector; video stream; visual cues; Acoustic signal detection; Audio recording; Background noise; Change detection algorithms; Detectors; Lagrangian functions; Layout; Loudspeakers; Speech; Video recording;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Content-Based Multimedia Indexing, 2007. CBMI '07. International Workshop on
Conference_Location :
Bordeaux
Print_ISBN :
1-4244-1011-8
Electronic_ISBN :
1-4244-1011-8
Type :
conf
DOI :
10.1109/CBMI.2007.385415
Filename :
4275078
Link To Document :
بازگشت