Hyper-cubic discretization for TD learning based on autonomous decentralized approach

Author

Kobayashi, Yoshiyuki ; Hosoe, Shigeyuki

Author_Institution

Bio-mimetic Control Res. Center, RIKEN, Nagoya, Japan

Volume

4

fYear

2003

fDate

5-8 Oct. 2003

Firstpage

3633

Abstract

Adaptive resolution of function approximator is known to be important when we apply reinforcement learning to unknown problems. We propose to apply successive division and integration scheme of function approximation to temporal difference learning based on local curvature. TD learning in continuous state-space is based on non-constant values function approximation, which requires the simplicity of function approximator representation. We define bases and local complexity of function approximator in the similar way to the autonomous decentralized function approximation, but they are much simpler. The simplicity of approximator element bring us much less computation and easier analysis. The proposed function approximator is proven to be effective through function approximation problem and a reinforcement learning standard problem, pendulum swing-up task.

Keywords

function approximation; learning (artificial intelligence); multivariable systems; state-space methods; adaptation algorithm; approximator element; autonomous decentralized approach; function approximation; function approximator; hypercubic discretization; local curvature; pendulum swing-up task; reinforcement learning; state-space methods; temporal difference learning; Adaptive control; Algorithm design and analysis; Approximation algorithms; Force control; Function approximation; Learning; Programmable control; Radial basis function networks; Shape; State-space methods;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man and Cybernetics, 2003. IEEE International Conference on

ISSN

1062-922X

Print_ISBN

0-7803-7952-7

Type

conf

DOI

10.1109/ICSMC.2003.1244453

Filename

1244453