DocumentCode :
3498598
Title :
An online actor-critic learning approach with Levenberg-Marquardt algorithm
Author :
Ni, Zhen ; He, Haibo ; Prokhorov, Danil V. ; Fu, Jian
Author_Institution :
Dept. of Electr., Comput. & Biomed. Eng., Univ. of Rhode Island, Kingston, RI, USA
fYear :
2011
fDate :
July 31 2011-Aug. 5 2011
Firstpage :
2333
Lastpage :
2340
Abstract :
This paper focuses on the efficiency improvement of online actor-critic design base on the Levenberg-Marquardt (LM) algorithm rather than traditional chain rule. Over the decades, several generations of adaptive/approximate dynamic programming (ADP) structures have been proposed in the community and demonstrated many successfully applications. Neural network with backpropagation has been one of the most important approaches to tune the parameters in such ADP designs. In this paper, we aim to study the integration of Levenberg-Marquardt method into the regular actor-critic design to improve weights updating and learning for a quadratic convergence under certain condition. Specifically, for the critic network design, we adopt the LM method targeting improved learning performance, while for the action network, we use the neural network with backpropagation to provide an appropriate control action. A detailed learning algorithm is presented, followed by benchmark tests of pendulum swing up and balance and cart-pole balance tasks. Various simulation results and comparative study demonstrated the effectiveness of this approach.
Keywords :
backpropagation; dynamic programming; neural nets; ADP; Levenberg-Marquardt algorithm; adaptive dynamic programming; approximate dynamic programming; backpropagation; cart-pole balance task; neural network; online actor-critic learning; pendulum swing up task; quadratic convergence; Algorithm design and analysis; Approximation algorithms; Artificial neural networks; Convergence; Equations; Jacobian matrices; Mathematical model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2011 International Joint Conference on
Conference_Location :
San Jose, CA
ISSN :
2161-4393
Print_ISBN :
978-1-4244-9635-8
Type :
conf
DOI :
10.1109/IJCNN.2011.6033520
Filename :
6033520
Link To Document :
بازگشت