A hierarchical reinforcement learning algorithm based on heuristic reward function

Author

Yan, Qicui ; Liu, Quan ; Hu, Daojing

Author_Institution

Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China

Volume

3

fYear

2010

fDate

27-29 March 2010

Firstpage

371

Lastpage

376

Abstract

A hierarchical reinforcement learning method based on heuristic reward function is proposed to solve the problem of “curse of dimensionality”, that is the states space will grow exponentially in the number of features, and low convergence speed. The method can reduce state spaces greatly and can enhance the speed of the study. Choose actions with favorable purpose and efficiency so as to optimize reward function and enhance convergence speed. Apply this method to the Tetris game; the experiment result shows that the method can partly solve the “curse of dimensionality” and can enhance the convergence speed prominent.

Keywords

computer games; convergence; learning (artificial intelligence); optimisation; Tetris game; convergence speed; dimensionality curse; heuristic reward function; hierarchical reinforcement learning algorithm; reward function optimization; Computer science; Control theory; Convergence; Function approximation; Heuristic algorithms; Learning systems; Machine learning; Space technology; State-space methods; Statistics; Tetris; curse of dimensionality; heuristic reward function; hierarchical reinforcement learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Advanced Computer Control (ICACC), 2010 2nd International Conference on

Conference_Location

Shenyang

Print_ISBN

978-1-4244-5845-5

Type

conf

DOI

10.1109/ICACC.2010.5486837

Filename

5486837