Title :
Learning Sensory Feedback to CPG with Policy Gradient for Biped Locomotion
Author :
Matsubara, Takamitsu ; Morimoto, Jun ; Nakanishi, Jun ; Sato, Masa-aki ; Doya, Kenji
Author_Institution :
Nara Institute of Science and Technology, Ikoma, Nara, Japan; ATR Computational Neuroscience Laboratories, Kyoto, Japan; takam-m@atr.jp
Abstract :
This paper proposes a learning framework for a CPG-based biped locomotion controller using a policy gradient method. Our goal in this study is to develop an efficient learning algorithm by reducing the dimensionality of the state space used for learning. We demonstrate that an appropriate feedback controller in the CPG-based controller can be acquired using the proposed method within a few thousand trials by numerical simulations. Furthermore, we implement the learned controller on the physical biped robot to experimentally show that the learned controller successfully works in the real environment.
Keywords :
Reinforcement learning; biped locomotion; central pattern generator; policy gradient; Adaptive control; Feedback; Gradient methods; Hip; Learning; Legged locomotion; Orbital robotics; Oscillators; Robot kinematics; State-space methods; Reinforcement learning; biped locomotion; central pattern generator; policy gradient;
Conference_Titel :
Robotics and Automation, 2005. ICRA 2005. Proceedings of the 2005 IEEE International Conference on
Print_ISBN :
0-7803-8914-X
DOI :
10.1109/ROBOT.2005.1570759