DocumentCode
3442983
Title
A new learning automaton for interaction with triple level environments
Author
Jamalian, A.H. ; Rezvani, R. ; Shams, H. ; Mehrabi, SH
Author_Institution
Sama Tech. & Vocational Training Coll., Islamic Azad Univ., Andisheh, Iran
fYear
2012
fDate
22-24 Aug. 2012
Firstpage
492
Lastpage
498
Abstract
Heretofore, the most presented Learning Automata (LA) is invented to interact with double level environments (one level for reward and the other for penalty). Those LA are often expedient, optimal or both of them and can minimize their mean value of receiving penalties (or at least converge to the minimum point) during time an d work much better than a pure-chance automaton. However, in many operational applications, the environment has three level responses; one for reward, one for small scale penalty and the last one for large scale penalty. In these applications, the old LA not only can not minimize the mean value of receiving penalties, but also in some cases their mean value of receiving penalties are even more than a pure-chance automaton. In this paper, first the triple level environments with illustrative example are described precisely. Then, the new fixed structure stochastic LA (called TILA) is introduced and its properties are considered mathematically. The simulation results show the old LA can not converge to the minimum value of the mean value of receiving penalties, but TILA receives fewer penalties in comparison with the older ones.
Keywords
learning (artificial intelligence); learning automata; fixed structure stochastic learning automata; interaction; operational application; reinforcement learning; reward; scale penalty; triple level environment; Automata; Equations; Learning; Learning automata; Performance evaluation; Stochastic processes; Learning Automata; Machine Learning; Reinforcement Learning; Triple Level Environments;
fLanguage
English
Publisher
ieee
Conference_Titel
Cognitive Informatics & Cognitive Computing (ICCI*CC), 2012 IEEE 11th International Conference on
Conference_Location
Kyoto
Print_ISBN
978-1-4673-2794-7
Type
conf
DOI
10.1109/ICCI-CC.2012.6311198
Filename
6311198
Link To Document