• DocumentCode
    349973
  • Title

    Application of reinforcement learning to balancing of Acrobot

  • Author

    Yoshimoto, Junichiro ; Ishii, Shin ; Sato, Masa-aki

  • Author_Institution
    Nara Inst. of Sci. & Technol., Ikoma, Japan
  • Volume
    5
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    516
  • Abstract
    The Acrobot is a two-link robot, actuated only at the joint between the two links. It is one of difficult tasks in reinforcement learning (RL) to control the Acrobot because it has nonlinear dynamics and continuous state and action spaces. In this article, we discuss applying the RL to the task of balancing control of the Acrobot. Our RL method has an architecture similar to the actor-critic. The actor and the critic are approximated by normalized Gaussian networks, which are trained by an online EM algorithm. We also introduce eligibility traces for our actor-critic architecture. Our computer simulation shows that our method is able to achieve fairly good control with a small number of trials
  • Keywords
    function approximation; learning (artificial intelligence); neurocontrollers; nonlinear control systems; robot dynamics; state-space methods; Acrobot; Gaussian networks; NGnets; action space; actor-critic architecture; balancing control; function approximation; nonlinear dynamics; online EM algorithm; reinforcement learning; state space; two-link robot; Computer architecture; Computer simulation; Control systems; Equations; Humans; Information processing; Machine learning; Motion control; Orbital robotics; Robot motion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
  • Conference_Location
    Tokyo
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-5731-0
  • Type

    conf

  • DOI
    10.1109/ICSMC.1999.815605
  • Filename
    815605