مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

50458

Title :

On Learning With Finite Memory

Author :

Drakopoulos, Kimon ; Ozdaglar, Asuman ; Tsitsiklis, John N.

Author_Institution :

Lab. of Inf. & Decision Syst., Massachusetts Inst. of Technol., Cambridge, MA, USA

Volume :

Issue :

fYear :

2013

fDate :

Oct. 2013

Firstpage :

6859

Lastpage :

6872

Abstract :

We consider an infinite collection of agents who make decisions, sequentially, about an unknown underlying binary state of the world. Each agent, prior to making a decision, receives an independent private signal whose distribution depends on the state of the world. Moreover, each agent also observes the decisions of its last K immediate predecessors. We study conditions under which the agent decisions converge to the correct value of the underlying state. We focus on the case where the private signals have bounded information content and investigate whether learning is possible, that is, whether there exist decision rules for the different agents that result in the convergence of their sequence of individual decisions to the correct state of the world. We first consider learning in the almost sure sense and show that it is impossible, for any value of K. We then explore the possibility of convergence in probability of the decisions to the correct state. Here, a distinction arises: if K=1, learning in probability is impossible under any decision rule, while for K ≥ 2, we design a decision rule that achieves it. We finally consider a new model, involving forward looking strategic agents, each of which maximizes the discounted sum (over all agents) of the probabilities of a correct decision. (The case, studied in the previous literature, of myopic agents who maximize the probability of their own decision being correct is an extreme special case.) We show that for any value of K, for any equilibrium of the associated Bayesian game, and under the assumption that each private signal has bounded information content, learning in probability fails to obtain.

Keywords :

learning (artificial intelligence); multi-agent systems; probability; Bayesian game; decision rule; distributed learning; finite memory; forward looking strategic agents; independent private signal; k immediate predecessors; myopic agents; probability; Bayes methods; Convergence; History; Indexes; Markov processes; Random variables; Testing; Decentralized; decision making; distributed estimation; distributed learning; estimation; estimation error; forward looking learning; learning; social learning;

fLanguage :

English

Journal_Title :

Information Theory, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9448

Type :

jour

DOI :

10.1109/TIT.2013.2262037

Filename :

6514539

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=50458