مرکز منطقه ای اطلاع رساني علوم و فناوري - Decentralized learning in finite Markov chains

DocumentCode :

3072077

Title :

Decentralized learning in finite Markov chains

Author :

Wheeler, R.M. ; Narendra, K.S.

Author_Institution :

Yale University, New Haven, CT

fYear :

1985

fDate :

11-13 Dec. 1985

Firstpage :

1868

Lastpage :

1873

Abstract :

The principal contribution of this paper is a new result on the decentralized control of finite Markov chains with unknown transition probabilities and rewards. One decentralized decision maker is associated with each state in which two or more actions (decisions) are available. Each decision maker uses a simple learning scheme, requiring minimal information, to update its action choice. It is shown that, if updating is done in sufficiently small steps, the group will converge to the policy that maximizes the long-term expected reward per step. The analysis is based on learning in sequential stochastic games and on certain properties, derived in this paper, of ergodic Markov chains.

Keywords :

Control systems; Costs; Dynamic programming; State-space methods; Stochastic processes; Uncertainty;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 1985 24th IEEE Conference on

Conference_Location :

Fort Lauderdale, FL, USA

Type :

conf

DOI :

10.1109/CDC.1985.268906

Filename :

4048644

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3072077