• DocumentCode
    2633693
  • Title

    Reinforcement learning using back-propagation as a building block

  • Author

    Mills, Peter M. ; Zomaya, Albert Y.

  • Author_Institution
    CRA Adv. Tech. Dev., Cannington, WA, Australia
  • fYear
    1991
  • fDate
    18-21 Nov 1991
  • Firstpage
    1554
  • Abstract
    A novel unsupervised reinforcement learning rule is introduced, based on the use of the supervised backpropagation algorithm as a component building block. The learning rule is easy to understand and implement in software and builds on the accumulated experience of researchers using backpropagation. Unlike most reinforcement learning systems, the new rule can operate with either continuously valued or binary outputs. It is very tolerant with respect to a wide variety of performance measures and is unrestricted in range and variability. The technique should find application in most reinforcement learning situations but should have particular benefit for learning control systems
  • Keywords
    learning systems; neural nets; accumulated experience; learning control systems; performance measures; supervised backpropagation; unsupervised reinforcement learning rule; Adaptive control; Application software; Control system synthesis; Control systems; Learning; Network synthesis; Neural networks; Optimal control; Performance evaluation; Programmable control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1991. 1991 IEEE International Joint Conference on
  • Print_ISBN
    0-7803-0227-3
  • Type

    conf

  • DOI
    10.1109/IJCNN.1991.170625
  • Filename
    170625