• DocumentCode
    2717703
  • Title

    Acquiring of walking behavior for four-legged robots using actor-critic method based on policy gradient

  • Author

    Inoue, R. ; Watanabe, K. ; Igarashi, H.

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
  • fYear
    2010
  • fDate
    8-10 Sept. 2010
  • Firstpage
    795
  • Lastpage
    800
  • Abstract
    Actor-critic method based on policy gradient is an effective method for problems with high-dimensional learning space. In walking behavior of robots, the movement of robot´s joint is cyclic. For this purpose, we apply closed-loops of B-spline curve to cyclic movement. Hence, we propose a method combined actor-critic method based on policy gradient with B-spline curve. In this method, B-spline curve uses for controlling target such as joints in robot on continuous space. By using B-spline curve, the target for control can move smoothly and flexible. In this paper, we applied this method to a four-legged robot with 12 joints and simulated on the Open Dynamics Engine (ODE) which is a dynamics simulation software. The simulation results show that robots succeeded in getting walking behavior.
  • Keywords
    closed loop systems; control engineering computing; digital simulation; gradient methods; learning (artificial intelligence); legged locomotion; robot dynamics; splines (mathematics); B-spline curve; actor critic method; closed loops; cyclic movement; dynamics simulation software; four legged robots; high dimensional learning space; open dynamics engine; policy gradient; walking behavior; Joints; Learning; Legged locomotion; Markov processes; Robot kinematics; Spline;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control (ISIC), 2010 IEEE International Symposium on
  • Conference_Location
    Yokohama
  • ISSN
    2158-9860
  • Print_ISBN
    978-1-4244-5360-3
  • Electronic_ISBN
    2158-9860
  • Type

    conf

  • DOI
    10.1109/ISIC.2010.5612891
  • Filename
    5612891