• DocumentCode
    21235
  • Title

    Heuristically-Accelerated Multiagent Reinforcement Learning

  • Author

    Bianchi, Reinaldo A. C. ; Martins, Murilo F. ; Ribeiro, Carlos H. C. ; Costa, Anna H. R.

  • Author_Institution
    Dept. of Electr. Eng., Centro Univ. da FEI, Sao Bernardo do Campo, Brazil
  • Volume
    44
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    252
  • Lastpage
    265
  • Abstract
    This paper presents a novel class of algorithms, called Heuristically-Accelerated Multiagent Reinforcement Learning (HAMRL), which allows the use of heuristics to speed up well-known multiagent reinforcement learning (RL) algorithms such as the Minimax-Q. Such HAMRL algorithms are characterized by a heuristic function, which suggests the selection of particular actions over others. This function represents an initial action selection policy, which can be handcrafted, extracted from previous experience in distinct domains, or learnt from observation. To validate the proposal, a thorough theoretical analysis proving the convergence of four algorithms from the HAMRL class (HAMMQ, HAMQ(λ), HAMQS, and HAMS) is presented. In addition, a comprehensive systematical evaluation was conducted in two distinct adversarial domains. The results show that even the most straightforward heuristics can produce virtually optimal action selection policies in much fewer episodes, significantly improving the performance of the HAMRL over vanilla RL algorithms.
  • Keywords
    learning (artificial intelligence); multi-agent systems; HAMRL algorithms; heuristic function; heuristically accelerated multiagent reinforcement learning; systematical evaluation; virtually optimal action selection; Artificial intelligence; heuristic algorithms; machine learning; multiagent systems;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2013.2253094
  • Filename
    6502216