Title :
Reinforcement learning approach to locomotion adaptation in sloped environments
Author :
Andre, Joao ; Costa, Luis ; Santos, Cristina
Author_Institution :
Dept. of Ind. Electron., Univ. of Minho, Braga, Portugal
Abstract :
In this work, Path Integral Policy Improvement with Covariance Matrix Adaptation (PI2-CMA) is implemented and used to address biped locomotion optimization. Central Pattern Generators (CPGs) and Dynamic Movement Primitives (DMPs) are combined to easily produce complex joint trajectories for simulated DARwIn-OP humanoid robot. PI2-CMA seeks optimal DMPs´ weights that maximize frontal velocity when facing different challenges. The simulation environments demand adaptation from the controller in order to successfully walk in different slopes. Elitism was introduced in PI2-CMA in order to improve the convergence property of the algorithm. Results show that these approaches enabled easy adaptation of DARwIn-OP to new situations. The results are very promising and demonstrate the flexibility at generating new trajectories for locomotion.
Keywords :
covariance matrices; humanoid robots; learning (artificial intelligence); legged locomotion; motion control; trajectory control; CPG; DARwIn-OP humanoid robot; DMP; PI2-CMA; biped locomotion optimization; central pattern generators; dynamic movement primitives; locomotion adaptation; path integral policy improvement with covariance matrix adaptation; reinforcement learning approach; sloped environment; trajectory generation; Convergence; Generators; Hip; Joints; Legged locomotion; Trajectory;
Conference_Titel :
Autonomous Robot Systems and Competitions (ICARSC), 2014 IEEE International Conference on
Conference_Location :
Espinho
DOI :
10.1109/ICARSC.2014.6849780