• DocumentCode
    3647367
  • Title

    Imitation learning with non-parametric regression

  • Author

    Maarten Vaandrager;Robert Babuška;Lucian Buşoniu;Gabriel A.D. Lopes

  • Author_Institution
    Plotprojects, 1078MN Amsterdam, the Netherlands
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    91
  • Lastpage
    96
  • Abstract
    Humans are very fast learners. Yet, we rarely learn a task completely from scratch. Instead, we usually start with a rough approximation of the desired behavior and take the learning from there. In this paper, we use imitation to quickly generate a rough solution to a robotic task from demonstrations, supplied as a collection of state-space trajectories. Appropriate control actions needed to steer the system along the trajectories are then automatically learned in the form of a (nonlinear) state-feedback control law. The learning scheme has two components: a dynamic reference model and an adaptive inverse process model, both based on a data-driven, non-parametric method called local linear regression. The reference model infers the desired behavior from the demonstration trajectories, while the inverse process model provides the control actions to achieve this behavior and is improved online using learning. Experimental results with a pendulum swing-up problem and a robotic arm demonstrate the practical usefulness of this approach. The resulting learned dynamics are not limited to single trajectories, but capture instead the overall dynamics of the motion, making the proposed approach a promising step towards versatile learning machines such as future household robots, or robots for autonomous missions.
  • Keywords
    "Robots","Adaptation models","Approximation methods","Trajectory","Hidden Markov models","Memory management","Mathematical model"
  • Publisher
    ieee
  • Conference_Titel
    Automation Quality and Testing Robotics (AQTR), 2012 IEEE International Conference on
  • Print_ISBN
    978-1-4673-0701-7
  • Type

    conf

  • DOI
    10.1109/AQTR.2012.6237681
  • Filename
    6237681