DocumentCode
349973
Title
Application of reinforcement learning to balancing of Acrobot
Author
Yoshimoto, Junichiro ; Ishii, Shin ; Sato, Masa-aki
Author_Institution
Nara Inst. of Sci. & Technol., Ikoma, Japan
Volume
5
fYear
1999
fDate
1999
Firstpage
516
Abstract
The Acrobot is a two-link robot, actuated only at the joint between the two links. It is one of difficult tasks in reinforcement learning (RL) to control the Acrobot because it has nonlinear dynamics and continuous state and action spaces. In this article, we discuss applying the RL to the task of balancing control of the Acrobot. Our RL method has an architecture similar to the actor-critic. The actor and the critic are approximated by normalized Gaussian networks, which are trained by an online EM algorithm. We also introduce eligibility traces for our actor-critic architecture. Our computer simulation shows that our method is able to achieve fairly good control with a small number of trials
Keywords
function approximation; learning (artificial intelligence); neurocontrollers; nonlinear control systems; robot dynamics; state-space methods; Acrobot; Gaussian networks; NGnets; action space; actor-critic architecture; balancing control; function approximation; nonlinear dynamics; online EM algorithm; reinforcement learning; state space; two-link robot; Computer architecture; Computer simulation; Control systems; Equations; Humans; Information processing; Machine learning; Motion control; Orbital robotics; Robot motion;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location
Tokyo
ISSN
1062-922X
Print_ISBN
0-7803-5731-0
Type
conf
DOI
10.1109/ICSMC.1999.815605
Filename
815605
Link To Document