DocumentCode
671369
Title
Approximate dynamic programming solutions of multi-agent graphical games using actor-critic network structures
Author
Abouheaf, Mohammed I. ; Lewis, Frank L.
Author_Institution
Res. Inst., Univ. of Texas at Arlington, Fort Worth, TX, USA
fYear
2013
fDate
4-9 Aug. 2013
Firstpage
1
Lastpage
8
Abstract
This paper studies a new class of multi-agent discrete-time dynamical graphical games, where interactions between agents are restricted by a communication graph structure. The paper brings together discrete Hamiltonian mechanics, optimal control theory, cooperative control, game theory, reinforcement learning, and neural network structures to solve the multi-agent dynamical graphical games. Graphical game Bellman equations are derived and shown to be equivalent to certain graphical game Hamilton Jacobi Bellman equations developed herein. Reinforcement Learning techniques are used to solve these dynamical graphical games. Heuristic Dynamic Programming and Dual Heuristic Programming, are extended to solve the graphical games using only neighborhood information. Online adaptive learning structure is implemented using actor-critic networks to solve these graphical games.
Keywords
discrete time systems; dynamic programming; game theory; graph theory; learning (artificial intelligence); multi-robot systems; optimal control; actor-critic network structures; actor-critic networks; approximate dynamic programming solutions; communication graph structure; cooperative control; discrete Hamiltonian mechanics; dual heuristic programming; game theory; graphical game Bellman equations; graphical game Hamilton Jacobi Bellman equations; heuristic dynamic programming; multiagent discrete-time dynamical graphical games; multiagent dynamical graphical games; multiagent graphical games; neighborhood information; neural network structures; online adaptive learning structure; optimal control theory; reinforcement learning techniques; Dynamic programming; Equations; Games; Heuristic algorithms; Jacobian matrices; Optimal control; Synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), The 2013 International Joint Conference on
Conference_Location
Dallas, TX
ISSN
2161-4393
Print_ISBN
978-1-4673-6128-6
Type
conf
DOI
10.1109/IJCNN.2013.6706708
Filename
6706708
Link To Document