• DocumentCode
    671369
  • Title

    Approximate dynamic programming solutions of multi-agent graphical games using actor-critic network structures

  • Author

    Abouheaf, Mohammed I. ; Lewis, Frank L.

  • Author_Institution
    Res. Inst., Univ. of Texas at Arlington, Fort Worth, TX, USA
  • fYear
    2013
  • fDate
    4-9 Aug. 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper studies a new class of multi-agent discrete-time dynamical graphical games, where interactions between agents are restricted by a communication graph structure. The paper brings together discrete Hamiltonian mechanics, optimal control theory, cooperative control, game theory, reinforcement learning, and neural network structures to solve the multi-agent dynamical graphical games. Graphical game Bellman equations are derived and shown to be equivalent to certain graphical game Hamilton Jacobi Bellman equations developed herein. Reinforcement Learning techniques are used to solve these dynamical graphical games. Heuristic Dynamic Programming and Dual Heuristic Programming, are extended to solve the graphical games using only neighborhood information. Online adaptive learning structure is implemented using actor-critic networks to solve these graphical games.
  • Keywords
    discrete time systems; dynamic programming; game theory; graph theory; learning (artificial intelligence); multi-robot systems; optimal control; actor-critic network structures; actor-critic networks; approximate dynamic programming solutions; communication graph structure; cooperative control; discrete Hamiltonian mechanics; dual heuristic programming; game theory; graphical game Bellman equations; graphical game Hamilton Jacobi Bellman equations; heuristic dynamic programming; multiagent discrete-time dynamical graphical games; multiagent dynamical graphical games; multiagent graphical games; neighborhood information; neural network structures; online adaptive learning structure; optimal control theory; reinforcement learning techniques; Dynamic programming; Equations; Games; Heuristic algorithms; Jacobian matrices; Optimal control; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2013 International Joint Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4673-6128-6
  • Type

    conf

  • DOI
    10.1109/IJCNN.2013.6706708
  • Filename
    6706708