• DocumentCode
    3642771
  • Title

    Approximate reinforcement learning: An overview

  • Author

    Lucian Buşoniu;Damien Ernst;Bart De Schutter;Robert Babuška

  • Author_Institution
    Delft Center for Systems &
  • fYear
    2011
  • fDate
    4/1/2011 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Reinforcement learning (RL) allows agents to learn how to optimally interact with complex environments. Fueled by recent advances in approximation-based algorithms, RL has obtained impressive successes in robotics, artificial intelligence, control, operations research, etc. However, the scarcity of survey papers about approximate RL makes it difficult for newcomers to grasp this intricate field. With the present overview, we take a step toward alleviating this situation. We review methods for approximate RL, starting from their dynamic programming roots and organizing them into three major classes: approximate value iteration, policy iteration, and policy search. Each class is subdivided into representative categories, highlighting among others offline and online algorithms, policy gradient methods, and simulation-based techniques. We also compare the different categories of methods, and outline possible ways to enhance the reviewed algorithms.
  • Keywords
    "Approximation algorithms","Equations","Function approximation","Trajectory","Markov processes","Mathematical model"
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
  • Print_ISBN
    978-1-4244-9887-1
  • Type

    conf

  • DOI
    10.1109/ADPRL.2011.5967353
  • Filename
    5967353