DocumentCode
2120445
Title
A Heuristic Reinforcement Learning Based on State Backtracking Method
Author
Min Fang ; Hao Li ; Xiaosong Zhang
Author_Institution
Inst. of Comput. Sci., Xidian Univ., Xi´an, China
Volume
1
fYear
2012
fDate
4-7 Dec. 2012
Firstpage
673
Lastpage
678
Abstract
Since learning action selection strategy is time-consuming due to the reinforcement learning algorithm, a heuristic reinforcement learning algorithm is presented based on the state backtracking reinforcement learning to improve the action selection strategy of the reinforcement learning. The selection strategies of repeated the action are analyzed and compared by state backtracking. A cost function is defined to denote the importance of repetitive actions. A novel heuristic function is given by combing the action-reward with the cost of an action. This algorithm reinforces the important of an action by heuristic function to speed learning and reduces unnecessary explorations by the cost function, so as to steadily improve the learning efficiency. The simulation results of two robot games proves that the algorithm can effectively enhancement the learning rate of Q-learning based on the state backtracking heuristic reinforcement learning method.
Keywords
backtracking; computer games; learning (artificial intelligence); mobile robots; Q-learning; action selection strategy; action-reward; cost function; learning efficiency; repetitive actions; robot games; speed learning; state backtracking heuristic reinforcement learning method; Heuristic function; Q-Learning; Reinforcement learning; State backtracking;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location
Macau
Print_ISBN
978-1-4673-6057-9
Type
conf
DOI
10.1109/WI-IAT.2012.187
Filename
6511961
Link To Document