مرکز منطقه ای اطلاع رساني علوم و فناوري - Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints

DocumentCode :

1761588

Title :

Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints

Author :

Xiong Yang ; Derong Liu ; Huang, Yuzhu

Author_Institution :

State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China

Volume :

Issue :

fYear :

2013

fDate :

November 21 2013

Firstpage :

2037

Lastpage :

2047

Abstract :

In this study, an online adaptive optimal control scheme is developed for solving the infinite-horizon optimal control problem of uncertain non-linear continuous-time systems with the control policy having saturation constraints. A novel identifier-critic architecture is presented to approximate the Hamilton-Jacobi-Bellman equation using two neural networks (NNs): an identifier NN is used to estimate the uncertain system dynamics and a critic NN is utilised to derive the optimal control instead of typical action-critic dual networks employed in reinforcement learning. Based on the developed architecture, the identifier NN and the critic NN are tuned simultaneously. Meanwhile, unlike initial stabilising control indispensable in policy iteration, there is no special requirement imposed on the initial control. Moreover, by using Lyapunov´s direct method, the weights of the identifier NN and the critic NN are guaranteed to be uniformly ultimately bounded, while keeping the closed-loop system stable. Finally, an example is provided to demonstrate the effectiveness of the present approach.

Keywords :

Lyapunov methods; adaptive control; approximation theory; closed loop systems; continuous time systems; neurocontrollers; nonlinear control systems; optimal control; robust control; uncertain systems; Hamilton-Jacobi-Bellman equation approximation; Lyapunov´s direct method; action-critic dual networks; closed loop system stability; control constraints; control policy; critic NN; identifier NN; identifier-critic architecture; infinite-horizon optimal control problem; neural network-based online adaptive optimal control; policy iteration; reinforcement learning; saturation constraints; uncertain nonlinear continuous-time systems; uncertain system dynamics;

fLanguage :

English

Journal_Title :

Control Theory & Applications, IET

Publisher :

iet

ISSN :

1751-8644

Type :

jour

DOI :

10.1049/iet-cta.2013.0472

Filename :

6668837

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1761588