DocumentCode
518709
Title
A hierarchical reinforcement learning algorithm based on heuristic reward function
Author
Yan, Qicui ; Liu, Quan ; Hu, Daojing
Author_Institution
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
Volume
3
fYear
2010
fDate
27-29 March 2010
Firstpage
371
Lastpage
376
Abstract
A hierarchical reinforcement learning method based on heuristic reward function is proposed to solve the problem of “curse of dimensionality”, that is the states space will grow exponentially in the number of features, and low convergence speed. The method can reduce state spaces greatly and can enhance the speed of the study. Choose actions with favorable purpose and efficiency so as to optimize reward function and enhance convergence speed. Apply this method to the Tetris game; the experiment result shows that the method can partly solve the “curse of dimensionality” and can enhance the convergence speed prominent.
Keywords
computer games; convergence; learning (artificial intelligence); optimisation; Tetris game; convergence speed; dimensionality curse; heuristic reward function; hierarchical reinforcement learning algorithm; reward function optimization; Computer science; Control theory; Convergence; Function approximation; Heuristic algorithms; Learning systems; Machine learning; Space technology; State-space methods; Statistics; Tetris; curse of dimensionality; heuristic reward function; hierarchical reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Computer Control (ICACC), 2010 2nd International Conference on
Conference_Location
Shenyang
Print_ISBN
978-1-4244-5845-5
Type
conf
DOI
10.1109/ICACC.2010.5486837
Filename
5486837
Link To Document