مرکز منطقه ای اطلاع رساني علوم و فناوري - Acquiring of walking behavior for four-legged robots using actor-critic method based on policy gradient

DocumentCode :

2717703

Title :

Acquiring of walking behavior for four-legged robots using actor-critic method based on policy gradient

Author :

Inoue, R. ; Watanabe, K. ; Igarashi, H.

Author_Institution :

Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan

fYear :

2010

fDate :

8-10 Sept. 2010

Firstpage :

795

Lastpage :

800

Abstract :

Actor-critic method based on policy gradient is an effective method for problems with high-dimensional learning space. In walking behavior of robots, the movement of robot´s joint is cyclic. For this purpose, we apply closed-loops of B-spline curve to cyclic movement. Hence, we propose a method combined actor-critic method based on policy gradient with B-spline curve. In this method, B-spline curve uses for controlling target such as joints in robot on continuous space. By using B-spline curve, the target for control can move smoothly and flexible. In this paper, we applied this method to a four-legged robot with 12 joints and simulated on the Open Dynamics Engine (ODE) which is a dynamics simulation software. The simulation results show that robots succeeded in getting walking behavior.

Keywords :

closed loop systems; control engineering computing; digital simulation; gradient methods; learning (artificial intelligence); legged locomotion; robot dynamics; splines (mathematics); B-spline curve; actor critic method; closed loops; cyclic movement; dynamics simulation software; four legged robots; high dimensional learning space; open dynamics engine; policy gradient; walking behavior; Joints; Learning; Legged locomotion; Markov processes; Robot kinematics; Spline;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Control (ISIC), 2010 IEEE International Symposium on

Conference_Location :

Yokohama

ISSN :

2158-9860

Print_ISBN :

978-1-4244-5360-3

Electronic_ISBN :

2158-9860

Type :

conf

DOI :

10.1109/ISIC.2010.5612891

Filename :

5612891

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2717703