• DocumentCode
    1485546
  • Title

    Decentralized Indirect Methods for Learning Automata Games

  • Author

    Tilak, Omkar ; Martin, Rashad ; Mukhopadhyay, Saibal

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Indiana Univ.-Purdue Univ. Indianapolis, Indianapolis, IN, USA
  • Volume
    41
  • Issue
    5
  • fYear
    2011
  • Firstpage
    1213
  • Lastpage
    1223
  • Abstract
    We discuss the application of indirect learning methods in zero-sum and identical payoff learning automata games. We propose a novel decentralized version of the well-known pursuit learning algorithm. Such a decentralized algorithm has significant computational advantages over its centralized counterpart. The theoretical study of such a decentralized algorithm requires the analysis to be carried out in a nonstationary environment. We use a novel bootstrapping argument to prove the convergence of the algorithm. To our knowledge, this is the first time that such analysis has been carried out for zero-sum and identical payoff games. Extensive simulation studies are reported, which demonstrate the proposed algorithm´s fast and accurate convergence in a variety of game scenarios. We also introduce the framework of partial communication in the context of identical payoff games of learning automata. In such games, the automata may not communicate with each other or may communicate selectively. This comprehensive framework has the capability to model both centralized and decentralized games discussed in this paper.
  • Keywords
    game theory; learning (artificial intelligence); learning automata; statistical analysis; algorithm convergence; bootstrapping argument; centralized games; decentralized algorithm; decentralized games; identical payoff learning automata games; indirect learning method; pursuit learning algorithm; zero-sum learning automata games; Algorithm design and analysis; Convergence; Games; Learning automata; Stochastic processes; Decentralized learning algorithms; games of learning automata; learning automata; reinforcement learning; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Cybernetics; Game Theory;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2011.2118749
  • Filename
    5740991