Evolutionary value function approximation

Author

Davarynejad, Mohsen ; Van Ast, Jelmer ; Vrancken, Jos ; Van den Berg, Jan

Author_Institution

Fac. of Technol., Policy & Manage., Delft Univ. of Technol., Delft, Netherlands

fYear

2011

fDate

11-15 April 2011

Firstpage

151

Lastpage

155

Abstract

The standard reinforcement learning algorithms have proven to be effective tools for letting an agent learn from its experiences generated by its interaction with an environment. In this paper an evolutionary approach is proposed to accelerate learning speed in tabular reinforcement learning algorithms. In the proposed approach, in order to accelerate the learning speed of agents, the state-value is not only approximated, but through using the concept of evolutionary algorithms, they are evolved, with extra bonus of giving each agent the opportunity to exchange its knowledge. The proposed evolutionary value function approximation, helps in moving from a single isolated learning stage to cooperative exploration of the search space and accelerating learning speed. The performance of the proposed algorithm is compared with the standard SARSA algorithm and some of its properties are discussed. The experimental analysis confirms that the proposed approach has higher convergent speed with a negligible increase in computational complexity.

Keywords

computational complexity; evolutionary computation; function approximation; learning (artificial intelligence); SARSA algorithm; agent learning speed; computational complexity; evolutionary algorithms; evolutionary value function approximation; tabular reinforcement learning algorithms; Acceleration; Approximation methods; Optimization;

fLanguage

English

Publisher

ieee

Conference_Titel

Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on

Conference_Location

Paris

Print_ISBN

978-1-4244-9887-1

Type

conf

DOI

10.1109/ADPRL.2011.5967349

Filename

5967349