• DocumentCode
    1155591
  • Title

    Absorbing and Ergodic Discretized Two-Action Learning Automata

  • Author

    Oommen, B. John

  • Volume
    16
  • Issue
    2
  • fYear
    1986
  • fDate
    3/1/1986 12:00:00 AM
  • Firstpage
    282
  • Lastpage
    293
  • Abstract
    A learning automaton is a machine that interacts with a random environment and that simultaneously learns the optimal action that the environment offers to it. Learning automata with variable structure are considered. Such automata are completely defined by a set of probability updating rules. Contrary to all the variable-structure stochastic automata (VSSA) discussed in the literature, which update the probabilities in such a way that an action probability can take any real value in the interval [0,1], the probability space is discretized so as to permit the action probability to assume one of a finite number of distinct values in [0,1]. The discretized automaton is termed linear or nonlinear depending on whether the subintervals of [0,1] are of equal length. It is proven that 1) discretized two-action linear reward-inaction automata are absorbing and ¿-optimal in all environments; 2) discretized two-action linear inaction-penalty automata are ergodic and expedient in all environments; 3) discretized two-action linear inaction-penalty learning automata with artificially created absorbing barriers are ¿-optimal in all random environments; and 4) there exist nonlinear discretized reward-inaction automata that are ¿-optimal in all random environments. The maximum advantage gained by rendering any finite-state discretized automaton nonlinear has also been derived.
  • Keywords
    Automatic testing; Computer science; Councils; Cybernetics; Learning automata; Machine learning; Pattern recognition; Stochastic processes; System testing; Telephony;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9472
  • Type

    jour

  • DOI
    10.1109/TSMC.1986.4308951
  • Filename
    4308951