DocumentCode :
671369
Title :
Approximate dynamic programming solutions of multi-agent graphical games using actor-critic network structures
Author :
Abouheaf, Mohammed I. ; Lewis, Frank L.
Author_Institution :
Res. Inst., Univ. of Texas at Arlington, Fort Worth, TX, USA
fYear :
2013
fDate :
4-9 Aug. 2013
Firstpage :
1
Lastpage :
8
Abstract :
This paper studies a new class of multi-agent discrete-time dynamical graphical games, where interactions between agents are restricted by a communication graph structure. The paper brings together discrete Hamiltonian mechanics, optimal control theory, cooperative control, game theory, reinforcement learning, and neural network structures to solve the multi-agent dynamical graphical games. Graphical game Bellman equations are derived and shown to be equivalent to certain graphical game Hamilton Jacobi Bellman equations developed herein. Reinforcement Learning techniques are used to solve these dynamical graphical games. Heuristic Dynamic Programming and Dual Heuristic Programming, are extended to solve the graphical games using only neighborhood information. Online adaptive learning structure is implemented using actor-critic networks to solve these graphical games.
Keywords :
discrete time systems; dynamic programming; game theory; graph theory; learning (artificial intelligence); multi-robot systems; optimal control; actor-critic network structures; actor-critic networks; approximate dynamic programming solutions; communication graph structure; cooperative control; discrete Hamiltonian mechanics; dual heuristic programming; game theory; graphical game Bellman equations; graphical game Hamilton Jacobi Bellman equations; heuristic dynamic programming; multiagent discrete-time dynamical graphical games; multiagent dynamical graphical games; multiagent graphical games; neighborhood information; neural network structures; online adaptive learning structure; optimal control theory; reinforcement learning techniques; Dynamic programming; Equations; Games; Heuristic algorithms; Jacobian matrices; Optimal control; Synchronization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2013 International Joint Conference on
Conference_Location :
Dallas, TX
ISSN :
2161-4393
Print_ISBN :
978-1-4673-6128-6
Type :
conf
DOI :
10.1109/IJCNN.2013.6706708
Filename :
6706708
Link To Document :
بازگشت