مرکز منطقه ای اطلاع رساني علوم و فناوري - A Novel Iterative <formula formulatype="inline"> <img src="/images/tex/16594.gif" alt="\\theta "> </formula>-Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems

DocumentCode :

26116

Title :

A Novel Iterative $\\theta$ -Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems

Author :

Qinglai Wei ; Derong Liu

Author_Institution :

State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China

Volume :

Issue :

fYear :

2014

fDate :

Oct. 2014

Firstpage :

1176

Lastpage :

1190

Abstract :

This paper is concerned with a new iterative θ-adaptive dynamic programming (ADP) technique to solve optimal control problems of infinite horizon discrete-time nonlinear systems. The idea is to use an iterative ADP algorithm to obtain the iterative control law which optimizes the iterative performance index function. In the present iterative θ-ADP algorithm, the condition of initial admissible control in policy iteration algorithm is avoided. It is proved that all the iterative controls obtained in the iterative θ-ADP algorithm can stabilize the nonlinear system which means that the iterative θ-ADP algorithm is feasible for implementations both online and offline. Convergence analysis of the performance index function is presented to guarantee that the iterative performance index function will converge to the optimum monotonically. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative θ-ADP algorithm. Finally, two simulation examples are given to illustrate the performance of the established method.

Keywords :

convergence; discrete time systems; dynamic programming; infinite horizon; iterative methods; neurocontrollers; nonlinear control systems; optimal control; performance index; stability; convergence analysis; discrete-time nonlinear systems; infinite horizon discrete-time nonlinear systems; iterative control law; iterative performance index function; iterative-ADP algorithm; iterative-adaptive dynamic programming; neural networks; nonlinear system stabilization; optimal control problems; Dynamic programming; Learning (artificial intelligence); Neural networks; Nonlinear systems; Optimal control; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming; neural networks; neuro-dynamic programming; nonlinear systems; optimal control; policy iteration; reinforcement learning; value iteration;

fLanguage :

English

Journal_Title :

Automation Science and Engineering, IEEE Transactions on

Publisher :

ieee

ISSN :

1545-5955

Type :

jour

DOI :

10.1109/TASE.2013.2280974

Filename :

6609148

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=26116