DocumentCode :
1734880
Title :
A Temporal Difference GNG-Based Algorithm That Can Learn to Control in Reinforcement Learning Environments
Author :
Vieira, Davi C. L. ; Adeodato, Paulo J. L. ; Goncalves, Paulo M.
Author_Institution :
Inst. Fed. de Sergipe - IFS, Aracaju, Brazil
Volume :
1
fYear :
2013
Firstpage :
329
Lastpage :
332
Abstract :
This paper proposes a new reinforcement learning algorithm called TD-GNG that uses the Growing Neural Gas (GNG) network to deal with environments of large domains. The proposed algorithm is capable to reduce the dimensionality of the problem by aggregating similar states. In experimental comparison against tile-coding in mountain car and puddle world, the TD-GNG showed an increase in the generalization without loosing quality in the policy obtained.
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); neural nets; GNG network; TD-GNG; dimensionality reduction; generalization; growing neural gas network; reinforcement learning environments; temporal difference GNG-based algorithm; Convergence; Electronic mail; Equations; Learning (artificial intelligence); Mathematical model; Neural networks; Prediction algorithms; Adaptive State Space Partitioning; Growing Neural Gas; Reinforcement Learning; Temporal Difference;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2013 12th International Conference on
Conference_Location :
Miami, FL
Type :
conf
DOI :
10.1109/ICMLA.2013.67
Filename :
6784637
Link To Document :
بازگشت