مرکز منطقه ای اطلاع رساني علوم و فناوري - A new approach to the design of reinforcement schemes for learning automata: stochastic estimator learning algorithms

DocumentCode :

1122820

Title :

A new approach to the design of reinforcement schemes for learning automata: stochastic estimator learning algorithms

Author :

Papadimitriou, Georgios I.

Author_Institution :

Dept. of Comput. Eng., Patras Univ., Greece

Volume :

Issue :

fYear :

1994

fDate :

8/1/1994 12:00:00 AM

Firstpage :

649

Lastpage :

654

Abstract :

A new class of learning automata is introduced. The new automata use a stochastic estimator and are able to operate in nonstationary environments with high accuracy and a high adaptation rate. According to the stochastic estimator scheme, the estimates of the mean rewards of actions are computed stochastically. So, they are not strictly dependent on the environmental responses. The dependence between the stochastic estimates and the deterministic estimator´s contents is more relaxed when the latter are old and probably invalid. In this way, actions that have not been selected recently have the opportunity to be estimated as “optimal”, to increase their choice probability, and, consequently, to be selected. Thus, the estimator is always recently updated and consequently is able to be adapted to environmental changes. The performance of the Stochastic Estimator Learning Automaton (SELA) is superior to the previous well-known S-model ergodic schemes. Furthermore, it is proved that SELA is absolutely expedient in every stationary S-model random environment

Keywords :

finite automata; stochastic automata; unsupervised learning; S-model ergodic scheme; SELA; Stochastic Estimator Learning Automaton; absolute expediency; high adaptation; learning automata; nonstationary environments; reinforcement schemes; stochastic estimator; stochastic estimator learning algorithms; Algorithm design and analysis; Application software; Computer networks; Feedback; Learning automata; Machine learning; State estimation; Stochastic processes;

fLanguage :

English

Journal_Title :

Knowledge and Data Engineering, IEEE Transactions on

Publisher :

ieee

ISSN :

1041-4347

Type :

jour

DOI :

10.1109/69.298183

Filename :

298183

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1122820