DocumentCode :
2524065
Title :
Multimodal talker localization in video conferencing environments
Author :
Lo, David ; Goubran, Rafik A. ; Dansereau, Richard M.
Author_Institution :
Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont., Canada
fYear :
2004
fDate :
2-3 Oct. 2004
Firstpage :
195
Lastpage :
200
Abstract :
This study proposes a new modular architecture for performing multimodal talker localization in the video conferencing environment. Data streams from sensing devices are decoupled in the beginning, and purpose specific localization methods are used to locate the talker separately. Individual localization results are then combined using data fusion techniques to form the final estimation of the talker´s location. The proposed architecture has the advantage of being flexible. Additional localization modality can be added by simply duplicating the functional module in the architecture with a new sensor and its associated localizer. The architecture is tested with three localization modalities: one audio, and two different video localizers. The results demonstrated that the modular architecture successfully yielded a multimodal localization method that outperforms single modal localization methods when the audio and video localizers are used as stand alone localization methods.
Keywords :
audio signal processing; sensor fusion; teleconferencing; video signal processing; audio localizers; data fusion; data streams; functional module; localization modality; modular architecture; multimodal localization; multimodal talker localization; sensing devices; single modal localization; specific localization methods; video conferencing environments; video localizers; Cameras; Computer architecture; Microphone arrays; Sensor arrays; Sensor fusion; Streaming media; Systems engineering and theory; Telecommunication traffic; Testing; Videoconference;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Haptic, Audio and Visual Environments and Their Applications, 2004. HAVE 2004. Proceedings. The 3rd IEEE International Workshop on
Print_ISBN :
0-7803-8817-8
Type :
conf
DOI :
10.1109/HAVE.2004.1391905
Filename :
1391905
Link To Document :
بازگشت