مرکز منطقه ای اطلاع رساني علوم و فناوري - A Temporal Difference GNG-Based Algorithm That Can Learn to Control in Reinforcement Learning Environments

DocumentCode :

1734880

Title :

A Temporal Difference GNG-Based Algorithm That Can Learn to Control in Reinforcement Learning Environments

Author :

Vieira, Davi C. L. ; Adeodato, Paulo J. L. ; Goncalves, Paulo M.

Author_Institution :

Inst. Fed. de Sergipe - IFS, Aracaju, Brazil

Volume :

fYear :

2013

Firstpage :

329

Lastpage :

332

Abstract :

This paper proposes a new reinforcement learning algorithm called TD-GNG that uses the Growing Neural Gas (GNG) network to deal with environments of large domains. The proposed algorithm is capable to reduce the dimensionality of the problem by aggregating similar states. In experimental comparison against tile-coding in mountain car and puddle world, the TD-GNG showed an increase in the generalization without loosing quality in the policy obtained.

Keywords :

generalisation (artificial intelligence); learning (artificial intelligence); neural nets; GNG network; TD-GNG; dimensionality reduction; generalization; growing neural gas network; reinforcement learning environments; temporal difference GNG-based algorithm; Convergence; Electronic mail; Equations; Learning (artificial intelligence); Mathematical model; Neural networks; Prediction algorithms; Adaptive State Space Partitioning; Growing Neural Gas; Reinforcement Learning; Temporal Difference;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Applications (ICMLA), 2013 12th International Conference on

Conference_Location :

Miami, FL

Type :

conf

DOI :

10.1109/ICMLA.2013.67

Filename :

6784637

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1734880