مرکز منطقه ای اطلاع رساني علوم و فناوري - Approximate dynamic programming solutions of multi-agent graphical games using actor-critic network structures

DocumentCode :

671369

Title :

Approximate dynamic programming solutions of multi-agent graphical games using actor-critic network structures

Author :

Abouheaf, Mohammed I. ; Lewis, Frank L.

Author_Institution :

Res. Inst., Univ. of Texas at Arlington, Fort Worth, TX, USA

fYear :

2013

fDate :

4-9 Aug. 2013

Firstpage :

Lastpage :

Abstract :

This paper studies a new class of multi-agent discrete-time dynamical graphical games, where interactions between agents are restricted by a communication graph structure. The paper brings together discrete Hamiltonian mechanics, optimal control theory, cooperative control, game theory, reinforcement learning, and neural network structures to solve the multi-agent dynamical graphical games. Graphical game Bellman equations are derived and shown to be equivalent to certain graphical game Hamilton Jacobi Bellman equations developed herein. Reinforcement Learning techniques are used to solve these dynamical graphical games. Heuristic Dynamic Programming and Dual Heuristic Programming, are extended to solve the graphical games using only neighborhood information. Online adaptive learning structure is implemented using actor-critic networks to solve these graphical games.

Keywords :

discrete time systems; dynamic programming; game theory; graph theory; learning (artificial intelligence); multi-robot systems; optimal control; actor-critic network structures; actor-critic networks; approximate dynamic programming solutions; communication graph structure; cooperative control; discrete Hamiltonian mechanics; dual heuristic programming; game theory; graphical game Bellman equations; graphical game Hamilton Jacobi Bellman equations; heuristic dynamic programming; multiagent discrete-time dynamical graphical games; multiagent dynamical graphical games; multiagent graphical games; neighborhood information; neural network structures; online adaptive learning structure; optimal control theory; reinforcement learning techniques; Dynamic programming; Equations; Games; Heuristic algorithms; Jacobian matrices; Optimal control; Synchronization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), The 2013 International Joint Conference on

Conference_Location :

Dallas, TX

ISSN :

2161-4393

Print_ISBN :

978-1-4673-6128-6

Type :

conf

DOI :

10.1109/IJCNN.2013.6706708

Filename :

6706708

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=671369