DocumentCode :
1301146
Title :
Distributed learning of the global maximum in a two-player stochastic game with identical payoffs
Author :
Kumar, P. R Srikanta ; Young, Gia-kinh
Author_Institution :
Dept. of Electr., Comput. & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA
Issue :
6
fYear :
1985
Firstpage :
743
Lastpage :
753
Abstract :
Little is known about the distributed learning of the global maximum in a stochastic framework when there is no communication between the decisionmakers. The case of two decisionmakers is considered, and prior knowledge is assumed about the expected rewards. The asymmetries that may be present in the reward matrix is captured by the prior knowledge. It is shown that each decisionmaker completely unaware of the other converges to the global optimum with arbitrary accuracy over time.
Keywords :
Accuracy; Convergence; Cybernetics; Games; Learning automata; Steady-state; Stochastic processes;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9472
Type :
jour
DOI :
10.1109/TSMC.1985.6313458
Filename :
6313458
Link To Document :
بازگشت