Model-free adaptive learning solutions for discrete-time dynamic graphical games

Author

Abouheaf, Mohammed I. ; Lewis, Frank L. ; Mahmoud, Magdi S.

Author_Institution

Syst. Eng., King Fahd Univ. of Pet. & Miner. (KFUPM), Dhahran, Saudi Arabia

fYear

2014

fDate

15-17 Dec. 2014

Firstpage

3578

Lastpage

3583

Abstract

This paper introduces novel model-free adaptive learning algorithm to solve the dynamic graphical games in real-time. It allows online model-free tuning of the controller and critic networks. This algorithm solves the dynamic graphical game in a distributed fashion. Novel coupled Bellman equations and Hamiltonian functions are developed for the dynamic graphical games. Nash solution for the dynamic graphical game is given in terms of the solution to a set of coupled Hamilton-Jacobi-Bellman equations developed herein. An online model-free policy iteration algorithm is developed to learn the Nash solution for the dynamic graphical game in real-time. A proof of convergence for this algorithm is given under mild assumptions about the inter-connectivity properties of the graph.

Keywords

convergence of numerical methods; discrete time systems; game theory; iterative methods; learning (artificial intelligence); mathematics computing; Hamiltonian functions; Nash solution; controller networks; convergence; coupled Hamilton-Jacobi-Bellman equations; critic networks; discrete-time dynamic graphical games; interconnectivity properties; model-free adaptive learning algorithm; online model-free policy iteration algorithm; online model-free tuning; Equations; Games; Heuristic algorithms; Integrated circuits; Mathematical model; Optimal control; Synchronization;

fLanguage

English

Publisher

ieee

Conference_Titel

Decision and Control (CDC), 2014 IEEE 53rd Annual Conference on

Conference_Location

Los Angeles, CA

Print_ISBN

978-1-4799-7746-8

Type

conf

DOI

10.1109/CDC.2014.7039945

Filename

7039945