Title :
The asymptotic optimality of discretized linear reward-inaction learning automata
Author :
Oommen, B. John ; Hansen, Erik
Author_Institution :
School of Computer Sci., Carleton Univ., Ottawa, Ont., Canada
Abstract :
The automata considered have a variable structure and hence are completely described by action probability updating functions. The action probabilities can take only a finite number of prespecified values. These values linearly increase and the interval [0, 1] is divided into a number of equal length subintervals. The probability is updated by the automata only if the environment responds with a reward and hence they are called discretized linear reward-inaction automata. The asymptotic optimality of this family of automata is proved for all environments.
Keywords :
automata theory; learning systems; probability; variable structure systems; action probability updating functions; asymptotic optimality; discretized linear reward-inaction; environment responds; learning automata; variable structure; Accuracy; Automata; Convergence; Gold; Learning automata; Tin;
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMC.1984.6313256