DocumentCode
26116
Title
A Novel Iterative
-Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
Author
Qinglai Wei ; Derong Liu
Author_Institution
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
Volume
11
Issue
4
fYear
2014
fDate
Oct. 2014
Firstpage
1176
Lastpage
1190
Abstract
This paper is concerned with a new iterative θ-adaptive dynamic programming (ADP) technique to solve optimal control problems of infinite horizon discrete-time nonlinear systems. The idea is to use an iterative ADP algorithm to obtain the iterative control law which optimizes the iterative performance index function. In the present iterative θ-ADP algorithm, the condition of initial admissible control in policy iteration algorithm is avoided. It is proved that all the iterative controls obtained in the iterative θ-ADP algorithm can stabilize the nonlinear system which means that the iterative θ-ADP algorithm is feasible for implementations both online and offline. Convergence analysis of the performance index function is presented to guarantee that the iterative performance index function will converge to the optimum monotonically. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative θ-ADP algorithm. Finally, two simulation examples are given to illustrate the performance of the established method.
Keywords
convergence; discrete time systems; dynamic programming; infinite horizon; iterative methods; neurocontrollers; nonlinear control systems; optimal control; performance index; stability; convergence analysis; discrete-time nonlinear systems; infinite horizon discrete-time nonlinear systems; iterative control law; iterative performance index function; iterative-ADP algorithm; iterative-adaptive dynamic programming; neural networks; nonlinear system stabilization; optimal control problems; Dynamic programming; Learning (artificial intelligence); Neural networks; Nonlinear systems; Optimal control; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming; neural networks; neuro-dynamic programming; nonlinear systems; optimal control; policy iteration; reinforcement learning; value iteration;
fLanguage
English
Journal_Title
Automation Science and Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1545-5955
Type
jour
DOI
10.1109/TASE.2013.2280974
Filename
6609148
Link To Document