A new learning automaton for interaction with triple level environments

Author

Jamalian, A.H. ; Rezvani, R. ; Shams, H. ; Mehrabi, SH

Author_Institution

Sama Tech. & Vocational Training Coll., Islamic Azad Univ., Andisheh, Iran

fYear

2012

fDate

22-24 Aug. 2012

Firstpage

492

Lastpage

498

Abstract

Heretofore, the most presented Learning Automata (LA) is invented to interact with double level environments (one level for reward and the other for penalty). Those LA are often expedient, optimal or both of them and can minimize their mean value of receiving penalties (or at least converge to the minimum point) during time an d work much better than a pure-chance automaton. However, in many operational applications, the environment has three level responses; one for reward, one for small scale penalty and the last one for large scale penalty. In these applications, the old LA not only can not minimize the mean value of receiving penalties, but also in some cases their mean value of receiving penalties are even more than a pure-chance automaton. In this paper, first the triple level environments with illustrative example are described precisely. Then, the new fixed structure stochastic LA (called TILA) is introduced and its properties are considered mathematically. The simulation results show the old LA can not converge to the minimum value of the mean value of receiving penalties, but TILA receives fewer penalties in comparison with the older ones.

Keywords

learning (artificial intelligence); learning automata; fixed structure stochastic learning automata; interaction; operational application; reinforcement learning; reward; scale penalty; triple level environment; Automata; Equations; Learning; Learning automata; Performance evaluation; Stochastic processes; Learning Automata; Machine Learning; Reinforcement Learning; Triple Level Environments;

fLanguage

English

Publisher

ieee

Conference_Titel

Cognitive Informatics & Cognitive Computing (ICCI*CC), 2012 IEEE 11th International Conference on

Conference_Location

Kyoto

Print_ISBN

978-1-4673-2794-7

Type

conf

DOI

10.1109/ICCI-CC.2012.6311198

Filename

6311198