مرکز منطقه ای اطلاع رساني علوم و فناوري - Shaping in reinforcement learning via knowledge transferred from human-demonstrations

DocumentCode :

2250315

Title :

Shaping in reinforcement learning via knowledge transferred from human-demonstrations

Author :

Guofang, Wang ; Zhou, Fang ; Ping, Li ; Bo, Li

Author_Institution :

School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, P.R. China

fYear :

2015

fDate :

28-30 July 2015

Firstpage :

3033

Lastpage :

3038

Abstract :

Transfer has been widely used to ameliorate the slow convergence speed of reinforcement learning (RL) by reusing the previous obtained knowledge from other related but distinct tasks. In this paper, we propose a framework to transfer knowledge learned directly from human-demonstration trajectories of source tasks to shape the RL algorithm in target task, so as to avoid the time-consuming training process of RL in source tasks and thus we expand the learning paradigm of transfer in RL domains. In our framework, rather than transferring the most common value function or policy, we adopt the visit frequencies of states in successful demonstration trajectories as the acquired knowledge, and then perform transfer via shared agent space. Simulation experiments in obstacle avoidance problems suggest that the transferred knowledge could accelerate the learning process in target task obviously. And as a case study, the experiments show the potential of our framework in knowledge transfer in RL tasks.

Keywords :

Birds; Electron tubes; Games; Learning (artificial intelligence); Navigation; Shape; Trajectory; Human-demonstrations; Reinforcement learning; transfer;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Control Conference (CCC), 2015 34th Chinese

Conference_Location :

Hangzhou, China

Type :

conf

DOI :

10.1109/ChiCC.2015.7260106

Filename :

7260106

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2250315