DocumentCode :
3090688
Title :
Adaptive function approximation in reinforcement learning with an interpolating growing neural gas
Author :
Baumann, Martin ; Buning, Hans Kleine
Author_Institution :
Int. Grad. Sch. of Dynamic Intell. Syst., Univ. of Paderborn, Paderborn, Germany
fYear :
2012
fDate :
4-7 Dec. 2012
Firstpage :
512
Lastpage :
517
Abstract :
Q-Learning is a widely used method for dealing with reinforcement learning problems. To speed up learning and to exploit gained experience more efficiently it is highly beneficial to add generalization to Q-Learning and thus enabling the transfer of experience to unseen but similar states. In this paper, we report on improvements for GNG-Q, a combination of Q-Learning and growing neural gas (GNG). It solves reinforcement learning problems with continuous state spaces and simultaneously learns a proper approximation of the state space by starting with a coarse resolution that is gradually refined based on information achieved during learning. We introduce the Interpolating GNG-Q (IGNG-Q) that uses distance-based interpolation between learned Q-vectors, adjust the update rule, suggest a new refinement strategy and propose a new criterion to decide when a refinement is necessary. Furthermore, we argue that this criterion offers an implicit local stopping condition for changes made to the approximation. Additionally, we employ eligibility traces to speed up learning. The improved method is evaluated in continuous state spaces and the results are compared with several approaches from literature. Our experiments confirm that the modifications highly improve the efficiency of the approximation and that IGNG-Q is well competitive with existing methods.
Keywords :
function approximation; interpolation; learning (artificial intelligence); neural nets; GNG-Q; IGNG-Q; Q-learning; adaptive function approximation; continuous state spaces; distance-based interpolation; eligibility traces; implicit local stopping condition; interpolating growing neural gas; learned Q-vectors; refinement strategy; reinforcement learning; update rule adjustment; Function approximation; Interpolation; Learning; Neurons; Prototypes; Vectors; Continuous Reinforcement Learning; Function Approximation; Growing Neural Gas;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Hybrid Intelligent Systems (HIS), 2012 12th International Conference on
Conference_Location :
Pune
Print_ISBN :
978-1-4673-5114-0
Type :
conf
DOI :
10.1109/HIS.2012.6421387
Filename :
6421387
Link To Document :
بازگشت