DocumentCode :
2250315
Title :
Shaping in reinforcement learning via knowledge transferred from human-demonstrations
Author :
Guofang, Wang ; Zhou, Fang ; Ping, Li ; Bo, Li
Author_Institution :
School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, P.R. China
fYear :
2015
fDate :
28-30 July 2015
Firstpage :
3033
Lastpage :
3038
Abstract :
Transfer has been widely used to ameliorate the slow convergence speed of reinforcement learning (RL) by reusing the previous obtained knowledge from other related but distinct tasks. In this paper, we propose a framework to transfer knowledge learned directly from human-demonstration trajectories of source tasks to shape the RL algorithm in target task, so as to avoid the time-consuming training process of RL in source tasks and thus we expand the learning paradigm of transfer in RL domains. In our framework, rather than transferring the most common value function or policy, we adopt the visit frequencies of states in successful demonstration trajectories as the acquired knowledge, and then perform transfer via shared agent space. Simulation experiments in obstacle avoidance problems suggest that the transferred knowledge could accelerate the learning process in target task obviously. And as a case study, the experiments show the potential of our framework in knowledge transfer in RL tasks.
Keywords :
Birds; Electron tubes; Games; Learning (artificial intelligence); Navigation; Shape; Trajectory; Human-demonstrations; Reinforcement learning; transfer;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (CCC), 2015 34th Chinese
Conference_Location :
Hangzhou, China
Type :
conf
DOI :
10.1109/ChiCC.2015.7260106
Filename :
7260106
Link To Document :
بازگشت