DocumentCode :
698003
Title :
Multimodal speaker localization from omnidirectional videos
Author :
Reuse, Pascal ; Gurban, Mihai ; Austvoll, Ivar ; Thiran, Jean-Philippe
Author_Institution :
Signal Process. Lab. 5, Ecole Polytech. Fed. de Lausanne, Lausanne, Switzerland
fYear :
2009
fDate :
24-28 Aug. 2009
Firstpage :
735
Lastpage :
739
Abstract :
The use of omnidirectional cameras for videoconferencing promises to simplify the hardware setup necessary for large groups of participants. We investigate the use of a multimodal speaker detection algorithm on audio-visual sequences captured with such a camera, in particular, an algorithm that uses the audio energy together with the optical flow. We analyze several types of optical flow methods to determine the one which is appropriate to the omnidirectional context.
Keywords :
audio-visual systems; cameras; image sequences; speaker recognition; teleconferencing; audio-visual sequence capture; multimodal speaker detection algorithm; multimodal speaker localization; omnidirectional camera; omnidirectional videoconferencing; optical flow method; Cameras; Computer vision; Image motion analysis; Integrated optics; Optical distortion; Optical filters; Optical imaging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2009 17th European
Conference_Location :
Glasgow
Print_ISBN :
978-161-7388-76-7
Type :
conf
Filename :
7077577
Link To Document :
بازگشت