• DocumentCode
    3090995
  • Title

    Learning nonparametric policies by imitation

  • Author

    Grimes, David B. ; Rao, Rajesh P N

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Washington, Seattle, WA
  • fYear
    2008
  • fDate
    22-26 Sept. 2008
  • Firstpage
    2022
  • Lastpage
    2028
  • Abstract
    A long cherished goal in artificial intelligence has been the ability to endow a robot with the capacity to learn and generalize skills from watching a human teacher. Such an ability to learn by imitation has remained hard to achieve due to a number of factors, including the problem of learning in high-dimensional spaces and the problem of uncertainty. In this paper, we propose a new probabilistic approach to the problem of teaching a high degree-of-freedom robot (in particular, a humanoid robot) flexible and generalizable skills via imitation of a human teacher. The robot uses inference in a graphical model to learn sensor-based dynamics and infer a stable plan from a teacherpsilas demonstration of an action. The novel contribution of this work is a method for learning a nonparametric policy which generalizes a fixed action plan to operate over a continuous space of task variation. A notable feature of the approach is that it does not require any knowledge of the physics of the robot or the environment. By leveraging advances in probabilistic inference and Gaussian process regression, the method produces a nonparametric policy for sensor-based feedback control in continuous state and action spaces. We present experimental and simulation results using a Fujitsu HOAP-2 humanoid robot demonstrating imitation-based learning of a task involving lifting objects of different weights from a single human demonstration.
  • Keywords
    Gaussian processes; feedback; graph theory; humanoid robots; regression analysis; sensors; Fujitsu HOAP-2 humanoid robot; Gaussian process regression; artificial intelligence; degree-of-freedom robot; graphical model; high-dimensional spaces; human teacher; nonparametric policies; probabilistic approach; probabilistic inference; sensor-based feedback control; History; Humanoid robots; Humans; Planning; Probabilistic logic; Robot sensing systems; Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on
  • Conference_Location
    Nice
  • Print_ISBN
    978-1-4244-2057-5
  • Type

    conf

  • DOI
    10.1109/IROS.2008.4650778
  • Filename
    4650778