DocumentCode :
3518913
Title :
Model-based imitation learning by probabilistic trajectory matching
Author :
Englert, Peter ; Paraschos, Alexandros ; Peters, Jochen ; Deisenroth, Marc Peter
Author_Institution :
Dept. of Comput. Sci., Tech. Univ. Darmstadt, Darmstadt, Germany
fYear :
2013
fDate :
6-10 May 2013
Firstpage :
1922
Lastpage :
1927
Abstract :
One of the most elegant ways of teaching new skills to robots is to provide demonstrations of a task and let the robot imitate this behavior. Such imitation learning is a non-trivial task: Different anatomies of robot and teacher, and reduced robustness towards changes in the control task are two major difficulties in imitation learning. We present an imitation-learning approach to efficiently learn a task from expert demonstrations. Instead of finding policies indirectly, either via state-action mappings (behavioral cloning), or cost function learning (inverse reinforcement learning), our goal is to find policies directly such that predicted trajectories match observed ones. To achieve this aim, we model the trajectory of the teacher and the predicted robot trajectory by means of probability distributions. We match these distributions by minimizing their Kullback-Leibler divergence. In this paper, we propose to learn probabilistic forward models to compute a probability distribution over trajectories. We compare our approach to model-based reinforcement learning methods with hand-crafted cost functions. Finally, we evaluate our method with experiments on a real compliant robot.
Keywords :
learning (artificial intelligence); robot programming; statistical distributions; trajectory control; Kullback-Leibler divergence; behavioral cloning; cost function learning; inverse reinforcement learning; model-based imitation learning; probabilistic forward model; probabilistic trajectory matching; probability distribution; robot trajectory; state-action mappings; Cost function; Joints; Predictive models; Probabilistic logic; Probability distribution; Robots; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2013 IEEE International Conference on
Conference_Location :
Karlsruhe
ISSN :
1050-4729
Print_ISBN :
978-1-4673-5641-1
Type :
conf
DOI :
10.1109/ICRA.2013.6630832
Filename :
6630832
Link To Document :
بازگشت