مرکز منطقه ای اطلاع رساني علوم و فناوري - The absolutely expedient nonlinear reinforcement schemes under the unknown multiteacher environment

DocumentCode :

1298675

Title :

The absolutely expedient nonlinear reinforcement schemes under the unknown multiteacher environment

Author :

Baba, N.

Author_Institution :

Faculty of Engng., Tokushima Univ., Tokushima City, Japan

Issue :

fYear :

1983

Firstpage :

100

Lastpage :

108

Abstract :

Learning behaviours of variable-structure stochastic automata under a multiteacher environment are considered. The concepts of absolute expediency and ε-optimality in a single-teacher environment are extended by the introduction of an average weighted reward and are redefined for a multiteacher environment. As an extended form of the absolutely expedient learning algorithm, a general class of nonlinear learning algorithm, called the GAE scheme, is proposed as a reinforcement scheme in a multiteacher environment. It is shown that the GAE scheme is absolutely expedient and ε-optimal in the general n-teacher environment. Learning behaviours of the GAE scheme in various multiteacher environments are simulated by computer and the results indicate the effectiveness of the GAE scheme.

Keywords :

learning systems; nonlinear systems; stochastic automata; GAE scheme; absolutely expedient nonlinear reinforcement schemes; average weighted reward; epsilon optimality; nonlinear learning algorithm; unknown multiteacher environment; variable-structure stochastic automata; Automata; Computers; Cybernetics; Image analysis; Image edge detection; Learning automata;

fLanguage :

English

Journal_Title :

Systems, Man and Cybernetics, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9472

Type :

jour

DOI :

10.1109/TSMC.1983.6313039

Filename :

6313039

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1298675