• DocumentCode
    2591553
  • Title

    Associative reinforcement learning of real-valued functions

  • Author

    Gullapalli, VijayKumar

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Massachusetts Univ., Amherst, MA, USA
  • fYear
    1991
  • fDate
    13-16 Oct 1991
  • Firstpage
    1453
  • Abstract
    The author describes an algorithm, called the stochastic real-valued (SRV) algorithm, that uses evaluative performance feedback to learn associative maps from input vectors to real-valued actions. This algorithm is based on the pioneering work of A.G. Barto and P. Anandan (1985), in synthesizing associative reinforcement learning (ARL) algorithms using techniques from pattern classification and automata theory. A strong convergence theorem is presented that implies a form of optimal performance under certain general conditions of the SRV algorithm on ARL tasks. Simulation results are presented to illustrate the convergence behavior of the algorithm under the conditions of the theorem. The robustness of the algorithm is also demonstrated by simulations in which some of the conditions of the theorem are violated
  • Keywords
    automata theory; convergence; learning systems; neural nets; pattern recognition; associative reinforcement learning; automata theory; evaluative performance feedback; neural nets; pattern classification; pattern recognition; stochastic real-valued algorithm; strong convergence theorem; Algorithm design and analysis; Convergence; Learning automata; Learning systems; Probability distribution; Robustness; Stochastic processes; Stochastic systems; Supervised learning; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1991. 'Decision Aiding for Complex Systems, Conference Proceedings., 1991 IEEE International Conference on
  • Conference_Location
    Charlottesville, VA
  • Print_ISBN
    0-7803-0233-8
  • Type

    conf

  • DOI
    10.1109/ICSMC.1991.169893
  • Filename
    169893