DocumentCode :
20705
Title :
Online approximate optimal control for affine non-linear systems with unknown internal dynamics using adaptive dynamic programming
Author :
Xiong Yang ; Derong Liu ; Qinglai Wei
Author_Institution :
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
Volume :
8
Issue :
16
fYear :
2014
fDate :
11 6 2014
Firstpage :
1676
Lastpage :
1688
Abstract :
In this study, a novel online adaptive dynamic programming (ADP)-based algorithm is developed for solving the optimal control problem of affine non-linear continuous-time systems with unknown internal dynamics. The present algorithm employs an observer-critic architecture to approximate the Hamilton-Jacobi-Bellman equation. Two neural networks (NNs) are used in this architecture: an NN state observer is constructed to estimate the unknown system dynamics and a critic NN is designed to derive the optimal control instead of typical action-critic dual networks employed in traditional ADP algorithms. Based on the developed architecture, the observer NN and the critic NN are tuned simultaneously. Meanwhile, unlike existing tuning laws for the critic, the newly developed critic update rule not only ensures convergence of the critic to the optimal control but also guarantees stability of the closed-loop system. No initial stabilising control is required, and by using recorded and instantaneous data simultaneously for the adaptation of the critic, the restrictive persistence of excitation condition is relaxed. In addition, Lyapunov direct method is utilised to demonstrate the uniform ultimate boundedness of the weights of the observer NN and the critic NN. Finally, an example is provided to verify the effectiveness of the present approach.
Keywords :
Lyapunov methods; closed loop systems; continuous time systems; dynamic programming; neurocontrollers; nonlinear control systems; observers; optimal control; stability; ADP-based algorithm; Hamilton-Jacobi-Bellman equation; Lyapunov direct method; NN state observer; action-critic dual networks; adaptive dynamic programming; affine nonlinear continuous-time systems; closed-loop system; critic update rule; excitation condition; instantaneous data; neural networks; observer-critic architecture; online approximate optimal control; recorded data; stability; uniform ultimate boundedness; unknown internal dynamics;
fLanguage :
English
Journal_Title :
Control Theory & Applications, IET
Publisher :
iet
ISSN :
1751-8644
Type :
jour
DOI :
10.1049/iet-cta.2014.0186
Filename :
6941355
Link To Document :
بازگشت