Title :
Distributed learning of the global maximum in a two-player stochastic game with identical payoffs
Author :
Kumar, P. R Srikanta ; Young, Gia-kinh
Author_Institution :
Dept. of Electr., Comput. & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA
Abstract :
Little is known about the distributed learning of the global maximum in a stochastic framework when there is no communication between the decisionmakers. The case of two decisionmakers is considered, and prior knowledge is assumed about the expected rewards. The asymmetries that may be present in the reward matrix is captured by the prior knowledge. It is shown that each decisionmaker completely unaware of the other converges to the global optimum with arbitrary accuracy over time.
Keywords :
Accuracy; Convergence; Cybernetics; Games; Learning automata; Steady-state; Stochastic processes;
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMC.1985.6313458