DocumentCode :
2492793
Title :
Associating spoken commands with multiple human users in a dynamic environment
Author :
Thompson, Simon ; Sasaki, Yoko ; Kagami, Satoshi
Author_Institution :
Digital Human Res. Center, Nat. Inst. of Adv. Ind. Sci. & Technol., Tokyo
fYear :
2008
fDate :
15-18 Dec. 2008
Firstpage :
207
Lastpage :
212
Abstract :
This paper presents a method for combining human tracking information with recognised sound commands extracted from localised and separated sound sources, enabling spoken commands to be placed in the context of an inhabited, dynamic environment. Humans are tracked using a laser range finder based system which extracts and tracks leg profiles from the raw range data. A 32 channel microphone array localises (angular) and separates different sound sources in the environment and feeds segmented sounds into a speech recognition system. Human tracking position estimates, sound source localisation results, and recognised commands are associated by probabilistic matching over time of the angle of observation from both the command sound source and the human position estimates. Experiments are conducted verifying the command recognition system and the ability to associate commands with humans in the case of a single human giving commands from multiple locations, and also multiple humans giving commands from multiple locations.
Keywords :
feature extraction; human-robot interaction; laser ranging; microphone arrays; mobile robots; pattern matching; probability; service robots; speech recognition; tracking; traffic engineering computing; associating spoken command; channel microphone array; dynamic environment; human tracking information system; human tracking position estimation; laser range finder based system; mobile robot; probabilistic matching; service robot; sound command extraction; sound source localisation; speech recognition system; tracked human pedestrian; Data mining; Humans; Loudspeakers; Mechanical engineering; Microphone arrays; Mobile robots; Noise robustness; Service robots; Speech enhancement; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Sensors, Sensor Networks and Information Processing, 2008. ISSNIP 2008. International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-3822-8
Electronic_ISBN :
978-1-4244-2957-8
Type :
conf
DOI :
10.1109/ISSNIP.2008.4761988
Filename :
4761988
Link To Document :
بازگشت