DocumentCode :
3517317
Title :
Off-line path integral reinforcement learning using stochastic robot dynamics approximated by sparse pseudo-input Gaussian processes: Application to humanoid robot motor learning in the real environment
Author :
Sugimoto, Naozo ; Morimoto, Jun
Author_Institution :
Dept. of Center for Inf. & Neural Networks, NICT, Kyoto, Japan
fYear :
2013
fDate :
6-10 May 2013
Firstpage :
1311
Lastpage :
1316
Abstract :
We develop fast reinforcement learning (RL) framework using the approximated dynamics of a humanoid robot. Although RL is a useful non-linear optimizer, applying it to real robotic systems is usually difficult due to the large number of iterations required to acquire suitable policies. In this study, we approximate the dynamics using data from a real robot with sparse pseudo-input Gaussian processes (SPGPs). By using SPGPs, we estimated the probability distribution considering both the input vector and output signal variances. In real environments, since the observations from robotic sensors include large noise, SPGPs can suitably approximate the stochastic dynamics of a real humanoid robot. We use the approximated dynamics to improve the performance of a movement task in a path integral RL framework, which updates a policy from the sampled trajectories of the state and action vectors and the cost. We implemented our proposed method on a real humanoid robot and tested on a via-point reaching task. The robot achieved successful performance with fewer number of interactions with the real environment by using the proposed method than a conventional approach which dose not use the simulated dynamics.
Keywords :
Gaussian processes; humanoid robots; learning (artificial intelligence); robot dynamics; statistical distributions; RL framework; SPGP; action vectors; fast reinforcement learning; humanoid robot motor learning; input vector; off-line path integral reinforcement learning; output signal variances; probability distribution; sparse pseudo-input Gaussian processes; state vectors; stochastic robot dynamics; via-point reaching task; Approximation methods; Gaussian processes; Humanoid robots; Joints; Trajectory; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2013 IEEE International Conference on
Conference_Location :
Karlsruhe
ISSN :
1050-4729
Print_ISBN :
978-1-4673-5641-1
Type :
conf
DOI :
10.1109/ICRA.2013.6630740
Filename :
6630740
Link To Document :
بازگشت