DocumentCode :
990272
Title :
Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to H_{\\infty } Control
Author :
Al-Tamimi, A. ; Abu-Khalaf, M. ; Lewis, Frank L.
Author_Institution :
Autom. & Robotics Res. Inst., Univ. of Texas, Fort Worth, TX
Volume :
37
Issue :
1
fYear :
2007
Firstpage :
240
Lastpage :
247
Abstract :
In this correspondence, adaptive critic approximate dynamic programming designs are derived to solve the discrete-time zero-sum game in which the state and action spaces are continuous. This results in a forward-in-time reinforcement learning algorithm that converges to the Nash equilibrium of the corresponding zero-sum game. The results in this correspondence can be thought of as a way to solve the Riccati equation of the well-known discrete-time Hinfin optimal control problem forward in time. Two schemes are presented, namely: 1) a heuristic dynamic programming and 2) a dual-heuristic dynamic programming, to solve for the value function and the costate of the game, respectively. An Hinfin autopilot design for an F-16 aircraft is presented to illustrate the results
Keywords :
Hinfin control; Riccati equations; discrete time systems; dynamic programming; game theory; learning (artificial intelligence); Hinfin optimal control problem; Nash equilibrium; Riccati equation; adaptive critic design; discrete-time zero-sum game; dynamic programming; forward-in-time reinforcement learning algorithm; Adaptive control; Aircraft; Control systems; Dynamic programming; Learning; Nash equilibrium; Neural networks; Optimal control; Programmable control; Riccati equations; $H_{infty}$ optimal control; Adaptive critics; approximate dynamic programming (ADP); policy iteration; zero-sum game; Artificial Intelligence; Computer Simulation; Game Theory; Models, Theoretical; Signal Processing, Computer-Assisted;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2006.880135
Filename :
4067064
Link To Document :
بازگشت