مرکز منطقه ای اطلاع رساني علوم و فناوري - The asymptotic optimality of discretized linear reward-inaction learning automata

DocumentCode :

1299975

Title :

The asymptotic optimality of discretized linear reward-inaction learning automata

Author :

Oommen, B. John ; Hansen, Erik

Author_Institution :

School of Computer Sci., Carleton Univ., Ottawa, Ont., Canada

Issue :

fYear :

1984

Firstpage :

542

Lastpage :

545

Abstract :

The automata considered have a variable structure and hence are completely described by action probability updating functions. The action probabilities can take only a finite number of prespecified values. These values linearly increase and the interval [0, 1] is divided into a number of equal length subintervals. The probability is updated by the automata only if the environment responds with a reward and hence they are called discretized linear reward-inaction automata. The asymptotic optimality of this family of automata is proved for all environments.

Keywords :

automata theory; learning systems; probability; variable structure systems; action probability updating functions; asymptotic optimality; discretized linear reward-inaction; environment responds; learning automata; variable structure; Accuracy; Automata; Convergence; Gold; Learning automata; Tin;

fLanguage :

English

Journal_Title :

Systems, Man and Cybernetics, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9472

Type :

jour

DOI :

10.1109/TSMC.1984.6313256

Filename :

6313256

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1299975