Zero-Sum Two-Player Game Theoretic Formulation of Affine Nonlinear Discrete-Time Systems Using Neural Networks

Author

Mehraeen, Shahab ; Dierks, Travis ; Jagannathan, Sarangapani ; Crow, Mariesa L.

Author_Institution

Dept. of Electr. & Comput. Eng., Louisiana State Univ., Baton Rouge, LA, USA

Volume

43

Issue

6

fYear

2013

fDate

Dec. 2013

Firstpage

1641

Lastpage

1655

Abstract

In this paper, the nearly optimal solution for discrete-time (DT) affine nonlinear control systems in the presence of partially unknown internal system dynamics and disturbances is considered. The approach is based on successive approximate solution of the Hamilton-Jacobi-Isaacs (HJI) equation, which appears in optimal control. Successive approximation approach for updating control and disturbance inputs for DT nonlinear affine systems are proposed. Moreover, sufficient conditions for the convergence of the approximate HJI solution to the saddle point are derived, and an iterative approach to approximate the HJI equation using a neural network (NN) is presented. Then, the requirement of full knowledge of the internal dynamics of the nonlinear DT system is relaxed by using a second NN online approximator. The result is a closed-loop optimal NN controller via offline learning. A numerical example is provided illustrating the effectiveness of the approach.

Keywords

approximation theory; closed loop systems; convergence of numerical methods; discrete time systems; game theory; iterative methods; learning (artificial intelligence); neurocontrollers; nonlinear control systems; nonlinear equations; optimal control; DT nonlinear affine system; HJI equation; Hamilton-Jacobi-Isaacs equation; closed loop optimal NN controller; convergence; discrete time system; iterative approach; neural network; offline learning; successive approximation approach; sufficient condition; unknown internal system dynamics; zero sum two player game theory; Approximation methods; Equations; Game theory; Games; Nonlinear systems; Optimal control; Taylor series; Hamilton–Jacobi–Isaacs (HJI); neural networks (NNs); nonlinear discrete-time (DT) systems; optimal control;

fLanguage

English

Journal_Title

Cybernetics, IEEE Transactions on

Publisher

ieee

ISSN

2168-2267

Type

jour

DOI

10.1109/TSMCB.2012.2227253

Filename

6392301