DocumentCode :
2095876
Title :
Speaker localization among multi-faces in noisy environment by audio-visual integration
Author :
Kim, Hyun-Don ; Choi, Jong-Suk ; Kim, Munsang
Author_Institution :
Intelligent Robotics Res. Center, Korea Inst. of Sci. & Technol., Seoul
fYear :
2006
fDate :
15-19 May 2006
Firstpage :
1305
Lastpage :
1310
Abstract :
In this paper, we not only developed a reliable sound localization system including VAD (voice activity detection) component using three microphones but also a face tracking system using a vision camera. Moreover, we proposed a way to integrate these systems in the human-robot interaction to compensate the errors in the localization of a speaker and to reject unnecessary speech or noise signals entering from the undesired directions effectively. For the purpose of verifying our system´s performances, we installed the proposed audition and vision system to the prototype robot, called IROBAA (Intelligent ROBot for Active Audition), and showed how to integrate an audio-visual system
Keywords :
audio signal processing; error compensation; face recognition; intelligent robots; man-machine systems; robot vision; signal denoising; speaker recognition; IROBAA prototype robot; active audition; error compensation; face tracking system; human-robot interaction; intelligent robot; noise signals rejection; sound localization system; speaker localization; vision camera; voice activity detection; Acoustic noise; Cameras; Face detection; Intelligent robots; Loudspeakers; Machine vision; Microphones; Prototypes; Speech enhancement; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation, 2006. ICRA 2006. Proceedings 2006 IEEE International Conference on
Conference_Location :
Orlando, FL
ISSN :
1050-4729
Print_ISBN :
0-7803-9505-0
Type :
conf
DOI :
10.1109/ROBOT.2006.1641889
Filename :
1641889
Link To Document :
بازگشت