DocumentCode
8447
Title
Reinforcement learning transfer based on subgoal discovery and subtask similarity
Author
Hao Wang ; Shunguo Fan ; Jinhua Song ; Yang Gao ; Xingguo Chen
Author_Institution
Dept. of Comput. Sci. & Technol., Nanjing Univ., Nanjing, China
Volume
1
Issue
3
fYear
2014
fDate
Jul-14
Firstpage
257
Lastpage
266
Abstract
This paper studies the problem of transfer learning in the context of reinforcement learning. We propose a novel transfer learning method that can speed up reinforcement learning with the aid of previously learnt tasks. Before performing extensive learning episodes, our method attempts to analyze the learning task via some exploration in the environment, and then attempts to reuse previous learning experience whenever it is possible and appropriate. In particular, our proposed method consists of four stages: 1) subgoal discovery, 2) option construction, 3) similarity searching, and 4) option reusing. Especially, in order to fulfill the task of identifying similar options, we propose a novel similarity measure between options, which is built upon the intuition that similar options have similar state-action probabilities. We examine our algorithm using extensive experiments, comparing it with existing methods. The results show that our method outperforms conventional non-transfer reinforcement learning algorithms, as well as existing transfer learning methods, by a wide margin.
Keywords
learning (artificial intelligence); probability; learning episodes; learning experience; option construction stage; option reusing stage; reinforcement learning transfer; similarity measure; similarity searching stage; state-action probability; subgoal discovery; subgoal discovery stage; subtask similarity; transfer learning method; Approximation algorithms; Games; Learning (artificial intelligence); Learning systems; Libraries; Trajectory; Reinforcement learning; options; source task library; subtask; transfer learning;
fLanguage
English
Journal_Title
Automatica Sinica, IEEE/CAA Journal of
Publisher
ieee
ISSN
2329-9266
Type
jour
DOI
10.1109/JAS.2014.7004683
Filename
7004683
Link To Document