Title :
Speech-based localization of multiple persons for an interface robot
Author :
Klaassen, Gradje ; Zajdel, Wojciech ; Kröse, Ben J A
Author_Institution :
ISLA, Informatics Inst., Amsterdam Univ., Netherlands
Abstract :
Robots are conveniently controlled by a human operator with spoken commands, since voice is a natural communication medium for humans. In order to successfully carry out a command, a robot needs to know which of the possibly many people gave the command and where this person is located. In this paper, we present a particle-filter based algorithm for localization of multiple speakers, in an environment where there is only one person speaking at a time. The algorithm incorporates person-specific voice features (vowel formant frequencies) in order to distinguish between the speakers. The voice features are supported by azimuth angle measurements obtained by a pair of microphones. We test our approach using the microphone system of the Philips iCat interface robot.
Keywords :
intelligent robots; man-machine systems; multiuser detection; speaker recognition; speech intelligibility; speech-based user interfaces; Philips iCat; human operator; interface robot; microphone system; particle-filter; person-specific voice features; robot control; speech-based localization; spoken commands; vowel formant frequencies; Auditory system; Azimuth; Frequency; Goniometers; Humans; Intelligent robots; Linear predictive coding; Loudspeakers; Microphones; Position measurement; Bayesian filtering; Human-robot interaction; Multi-target tracking; Speaker localization;
Conference_Titel :
Computational Intelligence in Robotics and Automation, 2005. CIRA 2005. Proceedings. 2005 IEEE International Symposium on
Print_ISBN :
0-7803-9355-4
DOI :
10.1109/CIRA.2005.1554253