DocumentCode :
663588
Title :
Skills transfer across dissimilar robots by learning context-dependent rewards
Author :
Malekzadeh, Milad S. ; Bruno, Danilo ; Calinon, Sylvain ; Nanayakkara, T. ; Caldwell, D.G.
Author_Institution :
Dept. of Adv. Robot., Ist. Italiano di Tecnol. (IIT), Genoa, Italy
fYear :
2013
fDate :
3-7 Nov. 2013
Firstpage :
1746
Lastpage :
1751
Abstract :
Robot programming by demonstration encompasses a wide range of learning strategies, from simple mimicking of the demonstrator´s actions to the higher level extraction of the underlying intent. By focusing on this last form, we study the problem of extracting the reward function explaining the demonstrations from a set of candidate reward functions, and using this information for self-refinement of the skill. This definition of the problem has links with inverse reinforcement learning problems in which the robot autonomously extracts an optimal reward function that defines the goal of the task. By relying on Gaussian mixture models, the proposed approach learns how the different candidate reward functions are combined, and in which contexts or phases of the task they are relevant for explaining the user´s demonstrations. The extracted reward profile is then exploited to improve the skill with a self-refinement approach based on expectation-maximization, allowing the imitator to reach a skill level that goes beyond the demonstrations. The approach can be used to reproduce a skill in different ways or to transfer tasks across robots of different structures. The proposed approach is tested in simulation with a new type of continuum robot (STIFF-FLOP), using kinesthetic demonstrations from a Barrett WAM manipulator.
Keywords :
Gaussian processes; expectation-maximisation algorithm; learning (artificial intelligence); manipulators; robot programming; Barrett WAM manipulator; Gaussian mixture model; STIFF-FLOP; context-dependent rewards; continuum robot; dissimilar robot; expectation-maximization; inverse reinforcement learning; kinesthetic demonstration; reward function; robot programming; self-refinement approach; skills transfer; Context; Kinematics; Learning (artificial intelligence); Manipulators; Trajectory; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
Conference_Location :
Tokyo
ISSN :
2153-0858
Type :
conf
DOI :
10.1109/IROS.2013.6696585
Filename :
6696585
Link To Document :
بازگشت