• DocumentCode
    3228253
  • Title

    A Temporal Difference GNG-Based Approach for the State Space Quantization in Reinforcement Learning Environments

  • Author

    Vieira, Davi C. L. ; Adeodato, Paulo J. L. ; Goncalves, Paulo M.

  • Author_Institution
    Inst. Fed. de Sergipe, Aracaju, Brazil
  • fYear
    2013
  • fDate
    4-6 Nov. 2013
  • Firstpage
    561
  • Lastpage
    568
  • Abstract
    The main issue when using reinforcement learning algorithms is how the estimation of the value function can be mapped into states. In very few cases it is possible to use tables but in the majority of cases, the number of states either can be too large to be kept into computer memory or it is computationally too expensive to visit all states. State aggregation models like the self-organizing maps have been used to make this possible by generalizing the input space and mapping the value functions into the states. This paper proposes a new algorithm called TD-GNG that uses the Growing Neural Gas (GNG) network to solve reinforcement learning problems by providing a way to map value functions into states. In experimental comparison against TD-AVQ and uniform discretization in three reinforcement problems, the TD-GNG showed improvements in three aspects, namely, 1) reduction of the dimensionality of the problem, 2) increase the generalization and 3) reduction of the convergence time. Experiments have also show that TD-GNG found a solution using less memory than TD-AVQ and uniform discretization without loosing quality in the policy obtained.
  • Keywords
    generalisation (artificial intelligence); learning (artificial intelligence); neural nets; TD-AVQ algorithm; TD-GNG algorithm; convergence time reduction; dimensionality reduction; generalization; growing neural gas; reinforcement learning environments; self-organizing maps; state aggregation models; state mapping; state space quantization; temporal difference GNG-based approach; uniform discretization; value function; Convergence; Electronic mail; Equations; Estimation; Learning (artificial intelligence); Mathematical model; Tiles; Adaptive State Space Partitioning; Growing Neural Gas; Q-Learning; Reinforcement Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2013 IEEE 25th International Conference on
  • Conference_Location
    Herndon, VA
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4799-2971-9
  • Type

    conf

  • DOI
    10.1109/ICTAI.2013.89
  • Filename
    6735300