Title :
Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to
Control
Author :
Al-Tamimi, A. ; Abu-Khalaf, M. ; Lewis, Frank L.
Author_Institution :
Autom. & Robotics Res. Inst., Univ. of Texas, Fort Worth, TX
Abstract :
In this correspondence, adaptive critic approximate dynamic programming designs are derived to solve the discrete-time zero-sum game in which the state and action spaces are continuous. This results in a forward-in-time reinforcement learning algorithm that converges to the Nash equilibrium of the corresponding zero-sum game. The results in this correspondence can be thought of as a way to solve the Riccati equation of the well-known discrete-time Hinfin optimal control problem forward in time. Two schemes are presented, namely: 1) a heuristic dynamic programming and 2) a dual-heuristic dynamic programming, to solve for the value function and the costate of the game, respectively. An Hinfin autopilot design for an F-16 aircraft is presented to illustrate the results
Keywords :
Hinfin control; Riccati equations; discrete time systems; dynamic programming; game theory; learning (artificial intelligence); Hinfin optimal control problem; Nash equilibrium; Riccati equation; adaptive critic design; discrete-time zero-sum game; dynamic programming; forward-in-time reinforcement learning algorithm; Adaptive control; Aircraft; Control systems; Dynamic programming; Learning; Nash equilibrium; Neural networks; Optimal control; Programmable control; Riccati equations; $H_{infty}$ optimal control; Adaptive critics; approximate dynamic programming (ADP); policy iteration; zero-sum game; Artificial Intelligence; Computer Simulation; Game Theory; Models, Theoretical; Signal Processing, Computer-Assisted;
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMCB.2006.880135