DocumentCode :
2717703
Title :
Acquiring of walking behavior for four-legged robots using actor-critic method based on policy gradient
Author :
Inoue, R. ; Watanabe, K. ; Igarashi, H.
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
fYear :
2010
fDate :
8-10 Sept. 2010
Firstpage :
795
Lastpage :
800
Abstract :
Actor-critic method based on policy gradient is an effective method for problems with high-dimensional learning space. In walking behavior of robots, the movement of robot´s joint is cyclic. For this purpose, we apply closed-loops of B-spline curve to cyclic movement. Hence, we propose a method combined actor-critic method based on policy gradient with B-spline curve. In this method, B-spline curve uses for controlling target such as joints in robot on continuous space. By using B-spline curve, the target for control can move smoothly and flexible. In this paper, we applied this method to a four-legged robot with 12 joints and simulated on the Open Dynamics Engine (ODE) which is a dynamics simulation software. The simulation results show that robots succeeded in getting walking behavior.
Keywords :
closed loop systems; control engineering computing; digital simulation; gradient methods; learning (artificial intelligence); legged locomotion; robot dynamics; splines (mathematics); B-spline curve; actor critic method; closed loops; cyclic movement; dynamics simulation software; four legged robots; high dimensional learning space; open dynamics engine; policy gradient; walking behavior; Joints; Learning; Legged locomotion; Markov processes; Robot kinematics; Spline;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control (ISIC), 2010 IEEE International Symposium on
Conference_Location :
Yokohama
ISSN :
2158-9860
Print_ISBN :
978-1-4244-5360-3
Electronic_ISBN :
2158-9860
Type :
conf
DOI :
10.1109/ISIC.2010.5612891
Filename :
5612891
Link To Document :
بازگشت