DocumentCode :
109477
Title :
Neural Network-Based Finite-Horizon Optimal Control of Uncertain Affine Nonlinear Discrete-Time Systems
Author :
Qiming Zhao ; Hao Xu ; Jagannathan, Sarangapani
Author_Institution :
Dept. of Electr. & Comput. Eng., Missouri Univ. of Sci. & Technol., Rolla, MO, USA
Volume :
26
Issue :
3
fYear :
2015
fDate :
Mar-15
Firstpage :
486
Lastpage :
499
Abstract :
In this paper, the finite-horizon optimal control design for nonlinear discrete-time systems in affine form is presented. In contrast with the traditional approximate dynamic programming methodology, which requires at least partial knowledge of the system dynamics, in this paper, the complete system dynamics are relaxed utilizing a neural network (NN)-based identifier to learn the control coefficient matrix. The identifier is then used together with the actor-critic-based scheme to learn the time-varying solution, referred to as the value function, of the Hamilton-Jacobi-Bellman (HJB) equation in an online and forward-in-time manner. Since the solution of HJB is time-varying, NNs with constant weights and time-varying activation functions are considered. To properly satisfy the terminal constraint, an additional error term is incorporated in the novel update law such that the terminal constraint error is also minimized over time. Policy and/or value iterations are not needed and the NN weights are updated once a sampling instant. The uniform ultimate boundedness of the closed-loop system is verified by standard Lyapunov stability theory under nonautonomous analysis. Numerical examples are provided to illustrate the effectiveness of the proposed method.
Keywords :
Lyapunov methods; closed loop systems; control system synthesis; discrete time systems; dynamic programming; matrix algebra; neurocontrollers; nonlinear control systems; optimal control; sampling methods; stability; uncertain systems; HJB; Hamilton-Jacobi-Bellman equation; NN; actor-critic-based scheme; approximate dynamic programming methodology; closed-loop system; control coefficient matrix; finite-horizon optimal control design; neural network-based finite-horizon optimal control; neural network-based identifier; nonautonomous analysis; sampling instant; standard Lyapunov stability theory; terminal constraint error; time-varying activation functions; time-varying solution; uncertain affine nonlinear discrete-time systems; uniform ultimate boundedness; value function; Approximation methods; Artificial neural networks; Discrete-time systems; Equations; Nonlinear dynamical systems; Optimal control; Finite-horizon; Hamilton-Jacobi-Bellman (HJB) equation; Hamilton???Jacobi???Bellman (HJB) equation; neural network (NN); optimal control; optimal control.;
fLanguage :
English
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
2162-237X
Type :
jour
DOI :
10.1109/TNNLS.2014.2315646
Filename :
6811227
Link To Document :
بازگشت