• DocumentCode
    3442983
  • Title

    A new learning automaton for interaction with triple level environments

  • Author

    Jamalian, A.H. ; Rezvani, R. ; Shams, H. ; Mehrabi, SH

  • Author_Institution
    Sama Tech. & Vocational Training Coll., Islamic Azad Univ., Andisheh, Iran
  • fYear
    2012
  • fDate
    22-24 Aug. 2012
  • Firstpage
    492
  • Lastpage
    498
  • Abstract
    Heretofore, the most presented Learning Automata (LA) is invented to interact with double level environments (one level for reward and the other for penalty). Those LA are often expedient, optimal or both of them and can minimize their mean value of receiving penalties (or at least converge to the minimum point) during time an d work much better than a pure-chance automaton. However, in many operational applications, the environment has three level responses; one for reward, one for small scale penalty and the last one for large scale penalty. In these applications, the old LA not only can not minimize the mean value of receiving penalties, but also in some cases their mean value of receiving penalties are even more than a pure-chance automaton. In this paper, first the triple level environments with illustrative example are described precisely. Then, the new fixed structure stochastic LA (called TILA) is introduced and its properties are considered mathematically. The simulation results show the old LA can not converge to the minimum value of the mean value of receiving penalties, but TILA receives fewer penalties in comparison with the older ones.
  • Keywords
    learning (artificial intelligence); learning automata; fixed structure stochastic learning automata; interaction; operational application; reinforcement learning; reward; scale penalty; triple level environment; Automata; Equations; Learning; Learning automata; Performance evaluation; Stochastic processes; Learning Automata; Machine Learning; Reinforcement Learning; Triple Level Environments;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cognitive Informatics & Cognitive Computing (ICCI*CC), 2012 IEEE 11th International Conference on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4673-2794-7
  • Type

    conf

  • DOI
    10.1109/ICCI-CC.2012.6311198
  • Filename
    6311198