DocumentCode :
1480451
Title :
ϵ-optimal discretized linear reward-penalty learning automata
Author :
Oommen, B.J. ; Christensen, J.P.R.
Author_Institution :
Sch. of Comput. Sci., Carleton Univ., Ottawa, Ont., Canada
Volume :
18
Issue :
3
fYear :
1988
Firstpage :
451
Lastpage :
458
Abstract :
Variable-structure stochastic automata (VSSA) are considered which interact with an environment and which dynamically learn the optimal action that the environment offers. Like all VSSA the automata are fully defined by a set of action-probability updating rules. However, to minimize the requirements on the random-number generator used to implement the VSSA, and to increase the speed of convergence of the automation, the case in which the probability-updating functions can assume only a finite number of values. These values discretize the probability space [0, 1] and hence they are called discretized learning automata. The discretized automata are linear because the subintervals of [0, 1] are of equal length. The authors prove the following results: (a) two-action discretized linear reward-penalty automata are ergodic and ε-optimal in all environments whose minimum penalty probability is less than 0.5; (b) there exist discretized two-action linear reward-penalty automata that are ergodic and ε-optimal in all random environments, and (c) discretized two-action linear reward-penalty automata with artificially created absorbing barriers are ε-optimal in all random environments
Keywords :
learning systems; probability; stochastic automata; automata theory; convergence; discretized learning automata; linear reward-penalty automata; probability-updating functions; variable structure, stochastic automata; Automatic testing; Convergence; Learning automata; Machine learning; Pattern recognition; Random number generation; Routing; Stochastic processes; System testing; Telephony;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9472
Type :
jour
DOI :
10.1109/21.7494
Filename :
7494
Link To Document :
بازگشت