DocumentCode :
3572793
Title :
Optimal learning control for discrete-time nonlinear systems using generalized policy iteration based adaptive dynamic programming
Author :
Qinglai Wei ; Derong Liu
Author_Institution :
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
fYear :
2014
Firstpage :
1781
Lastpage :
1786
Abstract :
In this paper, a novel generalized policy iteration algorithm is investigated to solve infinite horizon optimal control problems for discrete-time nonlinear systems. Two iteration indices are introduced in the generalized policy iteration algorithm, which iterate for policy improvement and policy evaluation, respectively. For the first time the properties of monotonicity, convergence and admissibility for the generalized policy iteration algorithm are analyzed to guarantee that the iterative performance index function converges to the optimum and the iterative control law stabilizes the control system. Finally, numerical results are presented to illustrate the performance of the developed method.
Keywords :
discrete time systems; dynamic programming; iterative methods; nonlinear control systems; optimal control; adaptive dynamic programming; discrete-time nonlinear systems; generalized policy iteration algorithm; infinite horizon optimal control problems; iteration indices; iterative control law; iterative performance index function; optimal learning control; policy evaluation; policy improvement; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming; generalized policy iteration; neural networks; neuro-dynamic programming; nonlinear systems; optimal control; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation (WCICA), 2014 11th World Congress on
Type :
conf
DOI :
10.1109/WCICA.2014.7052990
Filename :
7052990
Link To Document :
بازگشت