Title :
A Novel Iterative
-Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
Author :
Qinglai Wei ; Derong Liu
Author_Institution :
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
Abstract :
This paper is concerned with a new iterative θ-adaptive dynamic programming (ADP) technique to solve optimal control problems of infinite horizon discrete-time nonlinear systems. The idea is to use an iterative ADP algorithm to obtain the iterative control law which optimizes the iterative performance index function. In the present iterative θ-ADP algorithm, the condition of initial admissible control in policy iteration algorithm is avoided. It is proved that all the iterative controls obtained in the iterative θ-ADP algorithm can stabilize the nonlinear system which means that the iterative θ-ADP algorithm is feasible for implementations both online and offline. Convergence analysis of the performance index function is presented to guarantee that the iterative performance index function will converge to the optimum monotonically. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative θ-ADP algorithm. Finally, two simulation examples are given to illustrate the performance of the established method.
Keywords :
convergence; discrete time systems; dynamic programming; infinite horizon; iterative methods; neurocontrollers; nonlinear control systems; optimal control; performance index; stability; convergence analysis; discrete-time nonlinear systems; infinite horizon discrete-time nonlinear systems; iterative control law; iterative performance index function; iterative-ADP algorithm; iterative-adaptive dynamic programming; neural networks; nonlinear system stabilization; optimal control problems; Dynamic programming; Learning (artificial intelligence); Neural networks; Nonlinear systems; Optimal control; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming; neural networks; neuro-dynamic programming; nonlinear systems; optimal control; policy iteration; reinforcement learning; value iteration;
Journal_Title :
Automation Science and Engineering, IEEE Transactions on
DOI :
10.1109/TASE.2013.2280974