• DocumentCode
    518709
  • Title

    A hierarchical reinforcement learning algorithm based on heuristic reward function

  • Author

    Yan, Qicui ; Liu, Quan ; Hu, Daojing

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
  • Volume
    3
  • fYear
    2010
  • fDate
    27-29 March 2010
  • Firstpage
    371
  • Lastpage
    376
  • Abstract
    A hierarchical reinforcement learning method based on heuristic reward function is proposed to solve the problem of “curse of dimensionality”, that is the states space will grow exponentially in the number of features, and low convergence speed. The method can reduce state spaces greatly and can enhance the speed of the study. Choose actions with favorable purpose and efficiency so as to optimize reward function and enhance convergence speed. Apply this method to the Tetris game; the experiment result shows that the method can partly solve the “curse of dimensionality” and can enhance the convergence speed prominent.
  • Keywords
    computer games; convergence; learning (artificial intelligence); optimisation; Tetris game; convergence speed; dimensionality curse; heuristic reward function; hierarchical reinforcement learning algorithm; reward function optimization; Computer science; Control theory; Convergence; Function approximation; Heuristic algorithms; Learning systems; Machine learning; Space technology; State-space methods; Statistics; Tetris; curse of dimensionality; heuristic reward function; hierarchical reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computer Control (ICACC), 2010 2nd International Conference on
  • Conference_Location
    Shenyang
  • Print_ISBN
    978-1-4244-5845-5
  • Type

    conf

  • DOI
    10.1109/ICACC.2010.5486837
  • Filename
    5486837