مرکز منطقه ای اطلاع رساني علوم و فناوري - Application of reinforcement learning to balancing of Acrobot

DocumentCode :

349973

Title :

Application of reinforcement learning to balancing of Acrobot

Author :

Yoshimoto, Junichiro ; Ishii, Shin ; Sato, Masa-aki

Author_Institution :

Nara Inst. of Sci. & Technol., Ikoma, Japan

Volume :

fYear :

1999

fDate :

1999

Firstpage :

516

Abstract :

The Acrobot is a two-link robot, actuated only at the joint between the two links. It is one of difficult tasks in reinforcement learning (RL) to control the Acrobot because it has nonlinear dynamics and continuous state and action spaces. In this article, we discuss applying the RL to the task of balancing control of the Acrobot. Our RL method has an architecture similar to the actor-critic. The actor and the critic are approximated by normalized Gaussian networks, which are trained by an online EM algorithm. We also introduce eligibility traces for our actor-critic architecture. Our computer simulation shows that our method is able to achieve fairly good control with a small number of trials

Keywords :

function approximation; learning (artificial intelligence); neurocontrollers; nonlinear control systems; robot dynamics; state-space methods; Acrobot; Gaussian networks; NGnets; action space; actor-critic architecture; balancing control; function approximation; nonlinear dynamics; online EM algorithm; reinforcement learning; state space; two-link robot; Computer architecture; Computer simulation; Control systems; Equations; Humans; Information processing; Machine learning; Motion control; Orbital robotics; Robot motion;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on

Conference_Location :

Tokyo

ISSN :

1062-922X

Print_ISBN :

0-7803-5731-0

Type :

conf

DOI :

10.1109/ICSMC.1999.815605

Filename :

815605

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=349973