• DocumentCode
    575482
  • Title

    Adaptive model learning method for reinforcement learning

  • Author

    Hwang, Kao-Shing ; Jiang, Wei-Cheng ; Chen, Yu-Jen

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Sun Yat-Sen, Kaohsiung, Taiwan
  • fYear
    2012
  • fDate
    20-23 Aug. 2012
  • Firstpage
    1277
  • Lastpage
    1280
  • Abstract
    The original Q-learning method is difficult on achieving sample efficiency such as training a policy to get to a goal with in limited time step. So, the Dyna-Q agent is proposed to speed up the policy learning. However, the Dyna-Q did not specify how to build the model, so the table is used to be the model largely. In this paper, we proposed an adaptive model learning method based on tree structures and combined with Q-Learning to form Tree-Based Dyna-Q agent to enhance the policy learning. When the tree-based model learns an accurate model, a planning method can use the model to produce simulated experiences to accelerate value iterations. Thus, the agent with the proposed method can obtain virtual experiences for updating the policy. The simulation result shows that training time of our method can improve obviously.
  • Keywords
    iterative methods; learning (artificial intelligence); trees (mathematics); adaptive model learning method; original Q-learning method; planning method; policy learning; reinforcement learning; sample efficiency; tree-based Dyna-Q agent; tree-based model; value iterations; Adaptation models; Educational institutions; Learning; Learning systems; Planning; Silicon; Training; Dyna-Q agent; Reinforcement learning; adaptive model learning method;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SICE Annual Conference (SICE), 2012 Proceedings of
  • Conference_Location
    Akita
  • ISSN
    pending
  • Print_ISBN
    978-1-4673-2259-1
  • Type

    conf

  • Filename
    6318643