Title :
Probabilistic Speaker Localization in Noisy Environments by Audio-Visual Integration
Author :
Choi, Jong-Suk ; Kim, Munsang ; Kim, Hyun-Don
Author_Institution :
Intelligent Robotics Res. Center, Korea Inst. of Sci. & Technol., Seoul
Abstract :
In this paper, we have developed not only a probabilistic sound localization system including VAD (voice activity detection) component using three microphones but also a face tracking system using a vision camera. Moreover, we have proposed a way to integrate these systems to compensate the errors in the localization of a speaker and to reject unnecessary speech or noise signals entering from the undesired directions effectively. For the purpose of verifying our system´s performances, we have installed the proposed audition and vision system to the prototype robot, called IROBAA (intelligent robot for active audition), and showed how to integrate an audio-visual system
Keywords :
audio signal processing; control engineering computing; face recognition; image sensors; intelligent robots; man-machine systems; speaker recognition; IROBAA; audio-visual integration; face tracking system; intelligent robot; noisy environments; probabilistic sound localization system; probabilistic speaker localization; vision camera; voice activity detection; Acoustic noise; Cameras; Face detection; Intelligent robots; Loudspeakers; Machine vision; Microphones; Prototypes; Speech enhancement; Working environment noise; audio-visual integration; face tracking; human robot interaction; sound localization; voice activity detection;
Conference_Titel :
Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on
Conference_Location :
Beijing
Print_ISBN :
1-4244-0258-1
Electronic_ISBN :
1-4244-0259-X
DOI :
10.1109/IROS.2006.282260