• DocumentCode
    397950
  • Title

    Hyper-cubic discretization for TD learning based on autonomous decentralized approach

  • Author

    Kobayashi, Yoshiyuki ; Hosoe, Shigeyuki

  • Author_Institution
    Bio-mimetic Control Res. Center, RIKEN, Nagoya, Japan
  • Volume
    4
  • fYear
    2003
  • fDate
    5-8 Oct. 2003
  • Firstpage
    3633
  • Abstract
    Adaptive resolution of function approximator is known to be important when we apply reinforcement learning to unknown problems. We propose to apply successive division and integration scheme of function approximation to temporal difference learning based on local curvature. TD learning in continuous state-space is based on non-constant values function approximation, which requires the simplicity of function approximator representation. We define bases and local complexity of function approximator in the similar way to the autonomous decentralized function approximation, but they are much simpler. The simplicity of approximator element bring us much less computation and easier analysis. The proposed function approximator is proven to be effective through function approximation problem and a reinforcement learning standard problem, pendulum swing-up task.
  • Keywords
    function approximation; learning (artificial intelligence); multivariable systems; state-space methods; adaptation algorithm; approximator element; autonomous decentralized approach; function approximation; function approximator; hypercubic discretization; local curvature; pendulum swing-up task; reinforcement learning; state-space methods; temporal difference learning; Adaptive control; Algorithm design and analysis; Approximation algorithms; Force control; Function approximation; Learning; Programmable control; Radial basis function networks; Shape; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2003. IEEE International Conference on
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-7952-7
  • Type

    conf

  • DOI
    10.1109/ICSMC.2003.1244453
  • Filename
    1244453