Adaptive function approximation in reinforcement learning with an interpolating growing neural gas

Author

Baumann, Martin ; Buning, Hans Kleine

Author_Institution

Int. Grad. Sch. of Dynamic Intell. Syst., Univ. of Paderborn, Paderborn, Germany

fYear

2012

fDate

4-7 Dec. 2012

Firstpage

512

Lastpage

517

Abstract

Q-Learning is a widely used method for dealing with reinforcement learning problems. To speed up learning and to exploit gained experience more efficiently it is highly beneficial to add generalization to Q-Learning and thus enabling the transfer of experience to unseen but similar states. In this paper, we report on improvements for GNG-Q, a combination of Q-Learning and growing neural gas (GNG). It solves reinforcement learning problems with continuous state spaces and simultaneously learns a proper approximation of the state space by starting with a coarse resolution that is gradually refined based on information achieved during learning. We introduce the Interpolating GNG-Q (IGNG-Q) that uses distance-based interpolation between learned Q-vectors, adjust the update rule, suggest a new refinement strategy and propose a new criterion to decide when a refinement is necessary. Furthermore, we argue that this criterion offers an implicit local stopping condition for changes made to the approximation. Additionally, we employ eligibility traces to speed up learning. The improved method is evaluated in continuous state spaces and the results are compared with several approaches from literature. Our experiments confirm that the modifications highly improve the efficiency of the approximation and that IGNG-Q is well competitive with existing methods.

Keywords

function approximation; interpolation; learning (artificial intelligence); neural nets; GNG-Q; IGNG-Q; Q-learning; adaptive function approximation; continuous state spaces; distance-based interpolation; eligibility traces; implicit local stopping condition; interpolating growing neural gas; learned Q-vectors; refinement strategy; reinforcement learning; update rule adjustment; Function approximation; Interpolation; Learning; Neurons; Prototypes; Vectors; Continuous Reinforcement Learning; Function Approximation; Growing Neural Gas;

fLanguage

English

Publisher

ieee

Conference_Titel

Hybrid Intelligent Systems (HIS), 2012 12th International Conference on

Conference_Location

Pune

Print_ISBN

978-1-4673-5114-0

Type

conf

DOI

10.1109/HIS.2012.6421387

Filename

6421387