• DocumentCode
    1765718
  • Title

    Zero-Sum Two-Player Game Theoretic Formulation of Affine Nonlinear Discrete-Time Systems Using Neural Networks

  • Author

    Mehraeen, Shahab ; Dierks, Travis ; Jagannathan, Sarangapani ; Crow, Mariesa L.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Louisiana State Univ., Baton Rouge, LA, USA
  • Volume
    43
  • Issue
    6
  • fYear
    2013
  • fDate
    Dec. 2013
  • Firstpage
    1641
  • Lastpage
    1655
  • Abstract
    In this paper, the nearly optimal solution for discrete-time (DT) affine nonlinear control systems in the presence of partially unknown internal system dynamics and disturbances is considered. The approach is based on successive approximate solution of the Hamilton-Jacobi-Isaacs (HJI) equation, which appears in optimal control. Successive approximation approach for updating control and disturbance inputs for DT nonlinear affine systems are proposed. Moreover, sufficient conditions for the convergence of the approximate HJI solution to the saddle point are derived, and an iterative approach to approximate the HJI equation using a neural network (NN) is presented. Then, the requirement of full knowledge of the internal dynamics of the nonlinear DT system is relaxed by using a second NN online approximator. The result is a closed-loop optimal NN controller via offline learning. A numerical example is provided illustrating the effectiveness of the approach.
  • Keywords
    approximation theory; closed loop systems; convergence of numerical methods; discrete time systems; game theory; iterative methods; learning (artificial intelligence); neurocontrollers; nonlinear control systems; nonlinear equations; optimal control; DT nonlinear affine system; HJI equation; Hamilton-Jacobi-Isaacs equation; closed loop optimal NN controller; convergence; discrete time system; iterative approach; neural network; offline learning; successive approximation approach; sufficient condition; unknown internal system dynamics; zero sum two player game theory; Approximation methods; Equations; Game theory; Games; Nonlinear systems; Optimal control; Taylor series; Hamilton–Jacobi–Isaacs (HJI); neural networks (NNs); nonlinear discrete-time (DT) systems; optimal control;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TSMCB.2012.2227253
  • Filename
    6392301