• DocumentCode
    2497909
  • Title

    Evolutionary value function approximation

  • Author

    Davarynejad, Mohsen ; Van Ast, Jelmer ; Vrancken, Jos ; Van den Berg, Jan

  • Author_Institution
    Fac. of Technol., Policy & Manage., Delft Univ. of Technol., Delft, Netherlands
  • fYear
    2011
  • fDate
    11-15 April 2011
  • Firstpage
    151
  • Lastpage
    155
  • Abstract
    The standard reinforcement learning algorithms have proven to be effective tools for letting an agent learn from its experiences generated by its interaction with an environment. In this paper an evolutionary approach is proposed to accelerate learning speed in tabular reinforcement learning algorithms. In the proposed approach, in order to accelerate the learning speed of agents, the state-value is not only approximated, but through using the concept of evolutionary algorithms, they are evolved, with extra bonus of giving each agent the opportunity to exchange its knowledge. The proposed evolutionary value function approximation, helps in moving from a single isolated learning stage to cooperative exploration of the search space and accelerating learning speed. The performance of the proposed algorithm is compared with the standard SARSA algorithm and some of its properties are discussed. The experimental analysis confirms that the proposed approach has higher convergent speed with a negligible increase in computational complexity.
  • Keywords
    computational complexity; evolutionary computation; function approximation; learning (artificial intelligence); SARSA algorithm; agent learning speed; computational complexity; evolutionary algorithms; evolutionary value function approximation; tabular reinforcement learning algorithms; Acceleration; Approximation methods; Optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4244-9887-1
  • Type

    conf

  • DOI
    10.1109/ADPRL.2011.5967349
  • Filename
    5967349