• DocumentCode
    2120445
  • Title

    A Heuristic Reinforcement Learning Based on State Backtracking Method

  • Author

    Min Fang ; Hao Li ; Xiaosong Zhang

  • Author_Institution
    Inst. of Comput. Sci., Xidian Univ., Xi´an, China
  • Volume
    1
  • fYear
    2012
  • fDate
    4-7 Dec. 2012
  • Firstpage
    673
  • Lastpage
    678
  • Abstract
    Since learning action selection strategy is time-consuming due to the reinforcement learning algorithm, a heuristic reinforcement learning algorithm is presented based on the state backtracking reinforcement learning to improve the action selection strategy of the reinforcement learning. The selection strategies of repeated the action are analyzed and compared by state backtracking. A cost function is defined to denote the importance of repetitive actions. A novel heuristic function is given by combing the action-reward with the cost of an action. This algorithm reinforces the important of an action by heuristic function to speed learning and reduces unnecessary explorations by the cost function, so as to steadily improve the learning efficiency. The simulation results of two robot games proves that the algorithm can effectively enhancement the learning rate of Q-learning based on the state backtracking heuristic reinforcement learning method.
  • Keywords
    backtracking; computer games; learning (artificial intelligence); mobile robots; Q-learning; action selection strategy; action-reward; cost function; learning efficiency; repetitive actions; robot games; speed learning; state backtracking heuristic reinforcement learning method; Heuristic function; Q-Learning; Reinforcement learning; State backtracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
  • Conference_Location
    Macau
  • Print_ISBN
    978-1-4673-6057-9
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2012.187
  • Filename
    6511961