DocumentCode :
1831263
Title :
Convergence analysis of reinforcement learning approaches to humanoid locomotion
Author :
Tutsoy, O. ; Brown, Michael
Author_Institution :
Control Syst. Group, Univ. of Manchester, Manchester, UK
fYear :
2010
fDate :
7-10 Sept. 2010
Firstpage :
1
Lastpage :
6
Abstract :
Sophisticated intelligent machines such as humanoid robots require the ability to interact with the environment and hence efficiently adapt their behavior. Therefore, robots must be equipped with the ability to modify and add to its knowledge base using information gained from its past behaviour, such as stable, robust walking on unseen terrains. Currently, designing humanoid robots with advanced learning and cognitive capabilities is one of the most challenging issues in the field of intelligent robotics. The iCub and its newer version, the C-Cub, were developed as test beds for evaluating how cognitive and learning approaches can operate safely in unstructured environments. This paper describes preliminary work on evaluating the convergence of a variety of temporal difference learning algorithms, and comparing the results of each learning algorithm based on a simulation of a simple inverted pendulum in order to visualize the value and control action functions. It will be clearly showed that the learning performance of TD(λ) is significantly better than the TD(0) and stochastic gradient algorithm (SGA) based learning.
Keywords :
convergence; gradient methods; humanoid robots; intelligent robots; learning (artificial intelligence); nonlinear control systems; pendulums; stochastic processes; C-Cub; SGA based learning; convergence analysis; humanoid locomotion; humanoid robots; iCub; intelligent robotics; inverted pendulum; knowledge base systems; reinforcement learning; sophisticated intelligent machines; stochastic gradient algorithm; temporal difference learning; unseen terrains; unstructured environments; Humanoid Robot; Reinforcement Learning; Temporal Difference learning;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Control 2010, UKACC International Conference on
Conference_Location :
Coventry
Electronic_ISBN :
978-1-84600-038-6
Type :
conf
DOI :
10.1049/ic.2010.0439
Filename :
6490897
Link To Document :
بازگشت