• DocumentCode
    2613469
  • Title

    A natural language instruction system for humanoid robots integrating situated speech recognition, visual recognition and on-line whole-body motion generation

  • Author

    Neo, Ee Sian ; Sakaguchi, Takeshi ; YOKOI, Kazuhito

  • Author_Institution
    Intell. Syst. Res. Inst., Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba
  • fYear
    2008
  • fDate
    2-5 July 2008
  • Firstpage
    1176
  • Lastpage
    1182
  • Abstract
    This paper presents an integrated on-line operation system that enables a human user to operate humanoid robots by using natural language instructions. This paper has two major contributions. First, we present an integrated behavior system that is able to trigger behaviors according to speech commands, by recognizing objects, triggering actions and generating whole body motions on-line. Second, we present a situated natural language instruction system that is able not only to act according to speech commands, but also response to the direction of the sound source. A system that is able to understand natural language instructions and act accordingly will need the integration of knowledge representation, perception, decision making and on-line motion generation technologies. This paper tackles this integration problem by addressing the issues of representing knowledge of objects and actions which facilitates natural language instructions for tasks in indoor human environments. We propose a taxonomy of objects in indoor human environments and a lexicon of actions in this preliminary attempt to construct a reliable and flexible natural language instruction system. We report on the implementation of the proposed system on humanoid robot HRP-2, which is able to locate auditory sources and receive natural language instructions from a user within 2 meters using a 8-channel microphone array connected to a speech recognition embedded system on-board the robot.
  • Keywords
    humanoid robots; knowledge representation; microphone arrays; motion control; natural language processing; object recognition; speech recognition; 8-channel microphone array; HRP-2 robot; action triggering; auditory source location; decision making; humanoid robots; indoor human environment; integrated behavior system; knowledge representation; natural language instruction system; object recognition; online whole-body motion generation; perception; sound source direction; speech commands; speech recognition; visual recognition; Educational robots; Humanoid robots; Humans; Intelligent robots; Microphone arrays; Mobile robots; Natural languages; Robotic assembly; Service robots; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Intelligent Mechatronics, 2008. AIM 2008. IEEE/ASME International Conference on
  • Conference_Location
    Xian
  • Print_ISBN
    978-1-4244-2494-8
  • Electronic_ISBN
    978-1-4244-2495-5
  • Type

    conf

  • DOI
    10.1109/AIM.2008.4601829
  • Filename
    4601829