DocumentCode :
2943918
Title :
Learning Sensory Feedback to CPG with Policy Gradient for Biped Locomotion
Author :
Matsubara, Takamitsu ; Morimoto, Jun ; Nakanishi, Jun ; Sato, Masa-aki ; Doya, Kenji
Author_Institution :
Nara Institute of Science and Technology, Ikoma, Nara, Japan; ATR Computational Neuroscience Laboratories, Kyoto, Japan; takam-m@atr.jp
fYear :
2005
fDate :
18-22 April 2005
Firstpage :
4164
Lastpage :
4169
Abstract :
This paper proposes a learning framework for a CPG-based biped locomotion controller using a policy gradient method. Our goal in this study is to develop an efficient learning algorithm by reducing the dimensionality of the state space used for learning. We demonstrate that an appropriate feedback controller in the CPG-based controller can be acquired using the proposed method within a few thousand trials by numerical simulations. Furthermore, we implement the learned controller on the physical biped robot to experimentally show that the learned controller successfully works in the real environment.
Keywords :
Reinforcement learning; biped locomotion; central pattern generator; policy gradient; Adaptive control; Feedback; Gradient methods; Hip; Learning; Legged locomotion; Orbital robotics; Oscillators; Robot kinematics; State-space methods; Reinforcement learning; biped locomotion; central pattern generator; policy gradient;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation, 2005. ICRA 2005. Proceedings of the 2005 IEEE International Conference on
Print_ISBN :
0-7803-8914-X
Type :
conf
DOI :
10.1109/ROBOT.2005.1570759
Filename :
1570759
Link To Document :
بازگشت