مرکز منطقه ای اطلاع رساني علوم و فناوري - Skills transfer across dissimilar robots by learning context-dependent rewards

DocumentCode :

663588

Title :

Skills transfer across dissimilar robots by learning context-dependent rewards

Author :

Malekzadeh, Milad S. ; Bruno, Danilo ; Calinon, Sylvain ; Nanayakkara, T. ; Caldwell, D.G.

Author_Institution :

Dept. of Adv. Robot., Ist. Italiano di Tecnol. (IIT), Genoa, Italy

fYear :

2013

fDate :

3-7 Nov. 2013

Firstpage :

1746

Lastpage :

1751

Abstract :

Robot programming by demonstration encompasses a wide range of learning strategies, from simple mimicking of the demonstrator´s actions to the higher level extraction of the underlying intent. By focusing on this last form, we study the problem of extracting the reward function explaining the demonstrations from a set of candidate reward functions, and using this information for self-refinement of the skill. This definition of the problem has links with inverse reinforcement learning problems in which the robot autonomously extracts an optimal reward function that defines the goal of the task. By relying on Gaussian mixture models, the proposed approach learns how the different candidate reward functions are combined, and in which contexts or phases of the task they are relevant for explaining the user´s demonstrations. The extracted reward profile is then exploited to improve the skill with a self-refinement approach based on expectation-maximization, allowing the imitator to reach a skill level that goes beyond the demonstrations. The approach can be used to reproduce a skill in different ways or to transfer tasks across robots of different structures. The proposed approach is tested in simulation with a new type of continuum robot (STIFF-FLOP), using kinesthetic demonstrations from a Barrett WAM manipulator.

Keywords :

Gaussian processes; expectation-maximisation algorithm; learning (artificial intelligence); manipulators; robot programming; Barrett WAM manipulator; Gaussian mixture model; STIFF-FLOP; context-dependent rewards; continuum robot; dissimilar robot; expectation-maximization; inverse reinforcement learning; kinesthetic demonstration; reward function; robot programming; self-refinement approach; skills transfer; Context; Kinematics; Learning (artificial intelligence); Manipulators; Trajectory; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on

Conference_Location :

Tokyo

ISSN :

2153-0858

Type :

conf

DOI :

10.1109/IROS.2013.6696585

Filename :

6696585

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=663588