DocumentCode :
3647367
Title :
Imitation learning with non-parametric regression
Author :
Maarten Vaandrager;Robert Babuška;Lucian Buşoniu;Gabriel A.D. Lopes
Author_Institution :
Plotprojects, 1078MN Amsterdam, the Netherlands
fYear :
2012
fDate :
5/1/2012 12:00:00 AM
Firstpage :
91
Lastpage :
96
Abstract :
Humans are very fast learners. Yet, we rarely learn a task completely from scratch. Instead, we usually start with a rough approximation of the desired behavior and take the learning from there. In this paper, we use imitation to quickly generate a rough solution to a robotic task from demonstrations, supplied as a collection of state-space trajectories. Appropriate control actions needed to steer the system along the trajectories are then automatically learned in the form of a (nonlinear) state-feedback control law. The learning scheme has two components: a dynamic reference model and an adaptive inverse process model, both based on a data-driven, non-parametric method called local linear regression. The reference model infers the desired behavior from the demonstration trajectories, while the inverse process model provides the control actions to achieve this behavior and is improved online using learning. Experimental results with a pendulum swing-up problem and a robotic arm demonstrate the practical usefulness of this approach. The resulting learned dynamics are not limited to single trajectories, but capture instead the overall dynamics of the motion, making the proposed approach a promising step towards versatile learning machines such as future household robots, or robots for autonomous missions.
Keywords :
"Robots","Adaptation models","Approximation methods","Trajectory","Hidden Markov models","Memory management","Mathematical model"
Publisher :
ieee
Conference_Titel :
Automation Quality and Testing Robotics (AQTR), 2012 IEEE International Conference on
Print_ISBN :
978-1-4673-0701-7
Type :
conf
DOI :
10.1109/AQTR.2012.6237681
Filename :
6237681
Link To Document :
بازگشت