مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning Sensory Feedback to CPG with Policy Gradient for Biped Locomotion

DocumentCode :

2943918

Title :

Learning Sensory Feedback to CPG with Policy Gradient for Biped Locomotion

Author :

Matsubara, Takamitsu ; Morimoto, Jun ; Nakanishi, Jun ; Sato, Masa-aki ; Doya, Kenji

Author_Institution :

Nara Institute of Science and Technology, Ikoma, Nara, Japan; ATR Computational Neuroscience Laboratories, Kyoto, Japan; takam-m@atr.jp

fYear :

2005

fDate :

18-22 April 2005

Firstpage :

4164

Lastpage :

4169

Abstract :

This paper proposes a learning framework for a CPG-based biped locomotion controller using a policy gradient method. Our goal in this study is to develop an efficient learning algorithm by reducing the dimensionality of the state space used for learning. We demonstrate that an appropriate feedback controller in the CPG-based controller can be acquired using the proposed method within a few thousand trials by numerical simulations. Furthermore, we implement the learned controller on the physical biped robot to experimentally show that the learned controller successfully works in the real environment.

Keywords :

Reinforcement learning; biped locomotion; central pattern generator; policy gradient; Adaptive control; Feedback; Gradient methods; Hip; Learning; Legged locomotion; Orbital robotics; Oscillators; Robot kinematics; State-space methods; Reinforcement learning; biped locomotion; central pattern generator; policy gradient;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation, 2005. ICRA 2005. Proceedings of the 2005 IEEE International Conference on

Print_ISBN :

0-7803-8914-X

Type :

conf

DOI :

10.1109/ROBOT.2005.1570759

Filename :

1570759

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2943918