DocumentCode
2492793
Title
Associating spoken commands with multiple human users in a dynamic environment
Author
Thompson, Simon ; Sasaki, Yoko ; Kagami, Satoshi
Author_Institution
Digital Human Res. Center, Nat. Inst. of Adv. Ind. Sci. & Technol., Tokyo
fYear
2008
fDate
15-18 Dec. 2008
Firstpage
207
Lastpage
212
Abstract
This paper presents a method for combining human tracking information with recognised sound commands extracted from localised and separated sound sources, enabling spoken commands to be placed in the context of an inhabited, dynamic environment. Humans are tracked using a laser range finder based system which extracts and tracks leg profiles from the raw range data. A 32 channel microphone array localises (angular) and separates different sound sources in the environment and feeds segmented sounds into a speech recognition system. Human tracking position estimates, sound source localisation results, and recognised commands are associated by probabilistic matching over time of the angle of observation from both the command sound source and the human position estimates. Experiments are conducted verifying the command recognition system and the ability to associate commands with humans in the case of a single human giving commands from multiple locations, and also multiple humans giving commands from multiple locations.
Keywords
feature extraction; human-robot interaction; laser ranging; microphone arrays; mobile robots; pattern matching; probability; service robots; speech recognition; tracking; traffic engineering computing; associating spoken command; channel microphone array; dynamic environment; human tracking information system; human tracking position estimation; laser range finder based system; mobile robot; probabilistic matching; service robot; sound command extraction; sound source localisation; speech recognition system; tracked human pedestrian; Data mining; Humans; Loudspeakers; Mechanical engineering; Microphone arrays; Mobile robots; Noise robustness; Service robots; Speech enhancement; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Sensors, Sensor Networks and Information Processing, 2008. ISSNIP 2008. International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
978-1-4244-3822-8
Electronic_ISBN
978-1-4244-2957-8
Type
conf
DOI
10.1109/ISSNIP.2008.4761988
Filename
4761988
Link To Document