DocumentCode :
1299975
Title :
The asymptotic optimality of discretized linear reward-inaction learning automata
Author :
Oommen, B. John ; Hansen, Erik
Author_Institution :
School of Computer Sci., Carleton Univ., Ottawa, Ont., Canada
Issue :
3
fYear :
1984
Firstpage :
542
Lastpage :
545
Abstract :
The automata considered have a variable structure and hence are completely described by action probability updating functions. The action probabilities can take only a finite number of prespecified values. These values linearly increase and the interval [0, 1] is divided into a number of equal length subintervals. The probability is updated by the automata only if the environment responds with a reward and hence they are called discretized linear reward-inaction automata. The asymptotic optimality of this family of automata is proved for all environments.
Keywords :
automata theory; learning systems; probability; variable structure systems; action probability updating functions; asymptotic optimality; discretized linear reward-inaction; environment responds; learning automata; variable structure; Accuracy; Automata; Convergence; Gold; Learning automata; Tin;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9472
Type :
jour
DOI :
10.1109/TSMC.1984.6313256
Filename :
6313256
Link To Document :
بازگشت