• DocumentCode
    990272
  • Title

    Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to H_{\\infty } Control

  • Author

    Al-Tamimi, A. ; Abu-Khalaf, M. ; Lewis, Frank L.

  • Author_Institution
    Autom. & Robotics Res. Inst., Univ. of Texas, Fort Worth, TX
  • Volume
    37
  • Issue
    1
  • fYear
    2007
  • Firstpage
    240
  • Lastpage
    247
  • Abstract
    In this correspondence, adaptive critic approximate dynamic programming designs are derived to solve the discrete-time zero-sum game in which the state and action spaces are continuous. This results in a forward-in-time reinforcement learning algorithm that converges to the Nash equilibrium of the corresponding zero-sum game. The results in this correspondence can be thought of as a way to solve the Riccati equation of the well-known discrete-time Hinfin optimal control problem forward in time. Two schemes are presented, namely: 1) a heuristic dynamic programming and 2) a dual-heuristic dynamic programming, to solve for the value function and the costate of the game, respectively. An Hinfin autopilot design for an F-16 aircraft is presented to illustrate the results
  • Keywords
    Hinfin control; Riccati equations; discrete time systems; dynamic programming; game theory; learning (artificial intelligence); Hinfin optimal control problem; Nash equilibrium; Riccati equation; adaptive critic design; discrete-time zero-sum game; dynamic programming; forward-in-time reinforcement learning algorithm; Adaptive control; Aircraft; Control systems; Dynamic programming; Learning; Nash equilibrium; Neural networks; Optimal control; Programmable control; Riccati equations; $H_{infty}$ optimal control; Adaptive critics; approximate dynamic programming (ADP); policy iteration; zero-sum game; Artificial Intelligence; Computer Simulation; Game Theory; Models, Theoretical; Signal Processing, Computer-Assisted;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2006.880135
  • Filename
    4067064