DocumentCode
2771947
Title
Adaptive dynamic programming with stable value iteration algorithm for discrete-time nonlinear systems
Author
Wei, Qinglai ; Liu, Derong
Author_Institution
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
fYear
2012
fDate
10-15 June 2012
Firstpage
1
Lastpage
6
Abstract
In this paper, a new stable value iteration adaptive dynamic programming (ADP) algorithm, named “θ-ADP” algorithm, is proposed for solving the optimal control problems of infinite horizon discrete-time nonlinear systems. By introducing a parameter θ in the iterative ADP algorithm, it is proved that any of iterative control obtained in the proposed algorithm can stabilize the nonlinear system which overcomes the disadvantage of traditional value iteration algorithms. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative θ-ADP algorithm. Finally, a simulation example is given to illustrate the performance of the proposed method.
Keywords
adaptive control; approximation theory; discrete time systems; dynamic programming; infinite horizon; iterative methods; neurocontrollers; nonlinear control systems; optimal control; stability; adaptive dynamic programming algorithm; infinite horizon discrete-time nonlinear systems; iterative θ-ADP algorithm; iterative ADP algorithm; iterative control; neural networks; optimal control problems; performance index function approximation; stable value iteration algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location
Brisbane, QLD
ISSN
2161-4393
Print_ISBN
978-1-4673-1488-6
Electronic_ISBN
2161-4393
Type
conf
DOI
10.1109/IJCNN.2012.6252512
Filename
6252512
Link To Document