• DocumentCode
    8447
  • Title

    Reinforcement learning transfer based on subgoal discovery and subtask similarity

  • Author

    Hao Wang ; Shunguo Fan ; Jinhua Song ; Yang Gao ; Xingguo Chen

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Nanjing Univ., Nanjing, China
  • Volume
    1
  • Issue
    3
  • fYear
    2014
  • fDate
    Jul-14
  • Firstpage
    257
  • Lastpage
    266
  • Abstract
    This paper studies the problem of transfer learning in the context of reinforcement learning. We propose a novel transfer learning method that can speed up reinforcement learning with the aid of previously learnt tasks. Before performing extensive learning episodes, our method attempts to analyze the learning task via some exploration in the environment, and then attempts to reuse previous learning experience whenever it is possible and appropriate. In particular, our proposed method consists of four stages: 1) subgoal discovery, 2) option construction, 3) similarity searching, and 4) option reusing. Especially, in order to fulfill the task of identifying similar options, we propose a novel similarity measure between options, which is built upon the intuition that similar options have similar state-action probabilities. We examine our algorithm using extensive experiments, comparing it with existing methods. The results show that our method outperforms conventional non-transfer reinforcement learning algorithms, as well as existing transfer learning methods, by a wide margin.
  • Keywords
    learning (artificial intelligence); probability; learning episodes; learning experience; option construction stage; option reusing stage; reinforcement learning transfer; similarity measure; similarity searching stage; state-action probability; subgoal discovery; subgoal discovery stage; subtask similarity; transfer learning method; Approximation algorithms; Games; Learning (artificial intelligence); Learning systems; Libraries; Trajectory; Reinforcement learning; options; source task library; subtask; transfer learning;
  • fLanguage
    English
  • Journal_Title
    Automatica Sinica, IEEE/CAA Journal of
  • Publisher
    ieee
  • ISSN
    2329-9266
  • Type

    jour

  • DOI
    10.1109/JAS.2014.7004683
  • Filename
    7004683