DocumentCode :
992939
Title :
Convergence of teams and hierarchies of learning automata in connectionist systems
Author :
Thathachar, M. A L ; Phansalkar, V.V.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
Volume :
25
Issue :
11
fYear :
1995
fDate :
11/1/1995 12:00:00 AM
Firstpage :
1459
Lastpage :
1469
Abstract :
Learning algorithms for feedforward connectionist systems in a reinforcement learning environment are developed and analyzed in this paper. The connectionist system is made of units of groups of learning automata. The learning algorithm used is the LR-I and the asymptotic behavior of this algorithm is approximated by an ordinary differential equation (ODE) for low values of the learning parameter. This is done using weak convergence techniques. The reinforcement learning model is used to pose the goal of the system as a constrained optimization problem. It is shown that the ODE, and hence the algorithm exhibits local convergence properties, converging to local solutions of the related optimization problem. The three layer pattern recognition network is used as an example to show that the system does behave as predicted and reasonable rates of convergence are obtained. Simulations also show that the algorithm is robust to noise
Keywords :
differential equations; feedforward neural nets; finite automata; learning automata; multilayer perceptrons; pattern recognition; asymptotic behavior; connectionist systems; constrained optimization problem; feedforward connectionist system; learning algorithms; learning automata; ordinary differential equation; reinforcement learning environment; three layer pattern recognition network; weak convergence techniques; Algorithm design and analysis; Availability; Constraint optimization; Convergence; Differential equations; Learning automata; Noise robustness; Pattern recognition; Stochastic resonance; Working environment noise;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9472
Type :
jour
DOI :
10.1109/21.467711
Filename :
467711
Link To Document :
بازگشت