مرکز منطقه ای اطلاع رساني علوم و فناوري - An online actor-critic learning approach with Levenberg-Marquardt algorithm

DocumentCode :

3498598

Title :

An online actor-critic learning approach with Levenberg-Marquardt algorithm

Author :

Ni, Zhen ; He, Haibo ; Prokhorov, Danil V. ; Fu, Jian

Author_Institution :

Dept. of Electr., Comput. & Biomed. Eng., Univ. of Rhode Island, Kingston, RI, USA

fYear :

2011

fDate :

July 31 2011-Aug. 5 2011

Firstpage :

2333

Lastpage :

2340

Abstract :

This paper focuses on the efficiency improvement of online actor-critic design base on the Levenberg-Marquardt (LM) algorithm rather than traditional chain rule. Over the decades, several generations of adaptive/approximate dynamic programming (ADP) structures have been proposed in the community and demonstrated many successfully applications. Neural network with backpropagation has been one of the most important approaches to tune the parameters in such ADP designs. In this paper, we aim to study the integration of Levenberg-Marquardt method into the regular actor-critic design to improve weights updating and learning for a quadratic convergence under certain condition. Specifically, for the critic network design, we adopt the LM method targeting improved learning performance, while for the action network, we use the neural network with backpropagation to provide an appropriate control action. A detailed learning algorithm is presented, followed by benchmark tests of pendulum swing up and balance and cart-pole balance tasks. Various simulation results and comparative study demonstrated the effectiveness of this approach.

Keywords :

backpropagation; dynamic programming; neural nets; ADP; Levenberg-Marquardt algorithm; adaptive dynamic programming; approximate dynamic programming; backpropagation; cart-pole balance task; neural network; online actor-critic learning; pendulum swing up task; quadratic convergence; Algorithm design and analysis; Approximation algorithms; Artificial neural networks; Convergence; Equations; Jacobian matrices; Mathematical model;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), The 2011 International Joint Conference on

Conference_Location :

San Jose, CA

ISSN :

2161-4393

Print_ISBN :

978-1-4244-9635-8

Type :

conf

DOI :

10.1109/IJCNN.2011.6033520

Filename :

6033520

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3498598