DocumentCode :
1761588
Title :
Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints
Author :
Xiong Yang ; Derong Liu ; Huang, Yuzhu
Author_Institution :
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
Volume :
7
Issue :
17
fYear :
2013
fDate :
November 21 2013
Firstpage :
2037
Lastpage :
2047
Abstract :
In this study, an online adaptive optimal control scheme is developed for solving the infinite-horizon optimal control problem of uncertain non-linear continuous-time systems with the control policy having saturation constraints. A novel identifier-critic architecture is presented to approximate the Hamilton-Jacobi-Bellman equation using two neural networks (NNs): an identifier NN is used to estimate the uncertain system dynamics and a critic NN is utilised to derive the optimal control instead of typical action-critic dual networks employed in reinforcement learning. Based on the developed architecture, the identifier NN and the critic NN are tuned simultaneously. Meanwhile, unlike initial stabilising control indispensable in policy iteration, there is no special requirement imposed on the initial control. Moreover, by using Lyapunov´s direct method, the weights of the identifier NN and the critic NN are guaranteed to be uniformly ultimately bounded, while keeping the closed-loop system stable. Finally, an example is provided to demonstrate the effectiveness of the present approach.
Keywords :
Lyapunov methods; adaptive control; approximation theory; closed loop systems; continuous time systems; neurocontrollers; nonlinear control systems; optimal control; robust control; uncertain systems; Hamilton-Jacobi-Bellman equation approximation; Lyapunov´s direct method; action-critic dual networks; closed loop system stability; control constraints; control policy; critic NN; identifier NN; identifier-critic architecture; infinite-horizon optimal control problem; neural network-based online adaptive optimal control; policy iteration; reinforcement learning; saturation constraints; uncertain nonlinear continuous-time systems; uncertain system dynamics;
fLanguage :
English
Journal_Title :
Control Theory & Applications, IET
Publisher :
iet
ISSN :
1751-8644
Type :
jour
DOI :
10.1049/iet-cta.2013.0472
Filename :
6668837
Link To Document :
بازگشت