DocumentCode :
869974
Title :
The STAR automaton: expediency and optimality properties
Author :
Economides, Anastasios A. ; Kehagias, Athanasios
Author_Institution :
Dept. of Econ., Univ. of Macedonia, Thessaloniki, Greece
Volume :
32
Issue :
6
fYear :
2002
fDate :
12/1/2002 12:00:00 AM
Firstpage :
723
Lastpage :
737
Abstract :
We present the STack ARchitecture (STAR) automaton. It is a fixed structure, multiaction, reward-penalty learning automaton, characterized by a star-shaped state transition diagram. Each branch of the star contains D states associated with a particular action. The branches are connected to a central "neutral" state. The most general version of STAR involves probabilistic state transitions in response to reward and/or penalty, but deterministic transitions can also be used. The learning behavior of STAR results from the stack-like operation of the branches; the learning parameter is D. By mathematical analysis, it is shown that STAR with deterministic reward/probabilistic penalty and a sufficiently large D can be rendered ε-optimal in every stationary environment. By numerical simulation it is shown that in nonstationary, switching environments, STAR usually outperforms classical variable structure automata such as LR-P, LR-I, and LR-εP.
Keywords :
learning automata; numerical analysis; STAR automaton; deterministic reward/probabilistic penalty; fixed structure multiaction reward-penalty learning automaton; learning parameter; nonstationary switching environments; numerical simulation; probabilistic state transitions; stack architecture automaton; stack-like operation; star-shaped state transition diagram; Decision making; Environmental economics; Learning automata; Mathematical analysis; Mathematics; Numerical simulation; Psychology; Pursuit algorithms; Stochastic processes; Time factors;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2002.1049607
Filename :
1049607
Link To Document :
بازگشت