Title :
A natural language instruction system for humanoid robots integrating situated speech recognition, visual recognition and on-line whole-body motion generation
Author :
Neo, Ee Sian ; Sakaguchi, Takeshi ; YOKOI, Kazuhito
Author_Institution :
Intell. Syst. Res. Inst., Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba
Abstract :
This paper presents an integrated on-line operation system that enables a human user to operate humanoid robots by using natural language instructions. This paper has two major contributions. First, we present an integrated behavior system that is able to trigger behaviors according to speech commands, by recognizing objects, triggering actions and generating whole body motions on-line. Second, we present a situated natural language instruction system that is able not only to act according to speech commands, but also response to the direction of the sound source. A system that is able to understand natural language instructions and act accordingly will need the integration of knowledge representation, perception, decision making and on-line motion generation technologies. This paper tackles this integration problem by addressing the issues of representing knowledge of objects and actions which facilitates natural language instructions for tasks in indoor human environments. We propose a taxonomy of objects in indoor human environments and a lexicon of actions in this preliminary attempt to construct a reliable and flexible natural language instruction system. We report on the implementation of the proposed system on humanoid robot HRP-2, which is able to locate auditory sources and receive natural language instructions from a user within 2 meters using a 8-channel microphone array connected to a speech recognition embedded system on-board the robot.
Keywords :
humanoid robots; knowledge representation; microphone arrays; motion control; natural language processing; object recognition; speech recognition; 8-channel microphone array; HRP-2 robot; action triggering; auditory source location; decision making; humanoid robots; indoor human environment; integrated behavior system; knowledge representation; natural language instruction system; object recognition; online whole-body motion generation; perception; sound source direction; speech commands; speech recognition; visual recognition; Educational robots; Humanoid robots; Humans; Intelligent robots; Microphone arrays; Mobile robots; Natural languages; Robotic assembly; Service robots; Speech recognition;
Conference_Titel :
Advanced Intelligent Mechatronics, 2008. AIM 2008. IEEE/ASME International Conference on
Conference_Location :
Xian
Print_ISBN :
978-1-4244-2494-8
Electronic_ISBN :
978-1-4244-2495-5
DOI :
10.1109/AIM.2008.4601829