Title :
Learning to Control a 6-Degree-of-Freedom Walking Robot
Author :
Pawel Wawrzynski
Author_Institution :
Warsaw University of Technology, Institute of Control and Computation Engineering, Warsaw, Poland. Email: p.wawrzynski@elka.pw.edu.pl
Abstract :
We analyze the issue of optimizing a control policy for a complex system in a simulated trial-and-error learning process. The approach to this problem we consider is reinforcement learning (RL). Stationary policies, applied by most RL methods, may be improper in control applications, since for time discretization fine enough they do not exhibit exploration capabilities and define policy gradient estimators of very large variance. As a remedy to those difficulties, we proposed earlier the use of piecewise non-Markov policies. In the experimental study presented here we apply our approach to a 6-degree-of-freedom walking robot and obtain an efficient policy for this object.
Keywords :
"Legged locomotion","Control systems","Optimal control","Control engineering computing","Computational modeling","Analytical models","Computer simulation","Machine learning","Adaptive control","Intelligent agent"
Conference_Titel :
EUROCON, 2007. The International Conference on "Computer as a Tool"
Print_ISBN :
978-1-4244-0812-2
DOI :
10.1109/EURCON.2007.4400335