• DocumentCode
    3484829
  • Title

    A reinforcement learning algorithm for neural networks with incremental learning ability

  • Author

    Shiraga, Naoto ; Ozawa, Seiichi ; Abe, Shigeo

  • Author_Institution
    Graduate Sch. of Sci. & Technol., Kobe Univ., Japan
  • Volume
    5
  • fYear
    2002
  • fDate
    18-22 Nov. 2002
  • Firstpage
    2566
  • Abstract
    When neural networks are used for approximating action-values of Reinforcement Learning (RL) agents, the "interference" caused by incremental learning can be serious. To solve this problem, in this paper, a neural network model with incremental learning ability was applied to RL problems. In this model, correctly acquired input-output relations are stored into long-term memory, and the memorized data are effectively recalled in order to suppress the interference. In order to evaluate the incremental learning ability, the proposed model was applied to two problems: Extended Random-Walk Task and Extended Mountain-Car Task. In these tasks, the working space of agents is extended as the learning proceeds. In the simulations, we certified that the proposed model could acquire proper action-values as compared with the following three approaches to the approximation of action-value functions: tile coding, a conventional neural network model and the previously proposed neural network model.
  • Keywords
    learning (artificial intelligence); radial basis function networks; action-values; extended mountain-car task; extended random-walk task; incremental learning ability; input-output relations; interference suppression; long-term memory; neural networks; normalized radial basis functions; reinforcement learning algorithm; resource allocating network; Electronic mail; Function approximation; Interference; Learning; Neural networks; Resource management; State estimation; Table lookup; Tiles; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
  • Print_ISBN
    981-04-7524-1
  • Type

    conf

  • DOI
    10.1109/ICONIP.2002.1201958
  • Filename
    1201958