• DocumentCode
    2027727
  • Title

    Robot learning simultaneously a task and how to interpret human instructions

  • Author

    Grizou, Jonathan ; Lopes, M. ; Oudeyer, Pierre-Yves

  • Author_Institution
    Flowers Team, INRIA / ENSTA-Paristech, Paris, France
  • fYear
    2013
  • fDate
    18-22 Aug. 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper presents an algorithm to bootstrap shared understanding in a human-robot interaction scenario where the user teaches a robot a new task using teaching instructions yet unknown to it. In such cases, the robot needs to estimate simultaneously what the task is and the associated meaning of instructions received from the user. For this work, we consider a scenario where a human teacher uses initially unknown spoken words, whose associated unknown meaning is either a feedback (good/bad) or a guidance (go left, right, ...). We present computational results, within an inverse reinforcement learning framework, showing that a) it is possible to learn the meaning of unknown and noisy teaching instructions, as well as a new task at the same time, b) it is possible to reuse the acquired knowledge about instructions for learning new tasks, and c) even if the robot initially knows some of the instructions´ meanings, the use of extra unknown teaching instructions improves learning efficiency.
  • Keywords
    human-robot interaction; learning (artificial intelligence); bootstrap shared understanding; human instructions; human teacher; human-robot interaction; inverse reinforcement learning framework; noisy teaching instructions; robot simultaneous task learning; unknown meaning; unknown spoken words; Approximation algorithms; Computational modeling; Conferences; Education; Mathematical model; Robots; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Development and Learning and Epigenetic Robotics (ICDL), 2013 IEEE Third Joint International Conference on
  • Conference_Location
    Osaka
  • Type

    conf

  • DOI
    10.1109/DevLrn.2013.6652523
  • Filename
    6652523