Title :
Application of reinforcement learning to balancing of Acrobot
Author :
Yoshimoto, Junichiro ; Ishii, Shin ; Sato, Masa-aki
Author_Institution :
Nara Inst. of Sci. & Technol., Ikoma, Japan
Abstract :
The Acrobot is a two-link robot, actuated only at the joint between the two links. It is one of difficult tasks in reinforcement learning (RL) to control the Acrobot because it has nonlinear dynamics and continuous state and action spaces. In this article, we discuss applying the RL to the task of balancing control of the Acrobot. Our RL method has an architecture similar to the actor-critic. The actor and the critic are approximated by normalized Gaussian networks, which are trained by an online EM algorithm. We also introduce eligibility traces for our actor-critic architecture. Our computer simulation shows that our method is able to achieve fairly good control with a small number of trials
Keywords :
function approximation; learning (artificial intelligence); neurocontrollers; nonlinear control systems; robot dynamics; state-space methods; Acrobot; Gaussian networks; NGnets; action space; actor-critic architecture; balancing control; function approximation; nonlinear dynamics; online EM algorithm; reinforcement learning; state space; two-link robot; Computer architecture; Computer simulation; Control systems; Equations; Humans; Information processing; Machine learning; Motion control; Orbital robotics; Robot motion;
Conference_Titel :
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-7803-5731-0
DOI :
10.1109/ICSMC.1999.815605