Title :
Reinforcement learning using back-propagation as a building block
Author :
Mills, Peter M. ; Zomaya, Albert Y.
Author_Institution :
CRA Adv. Tech. Dev., Cannington, WA, Australia
Abstract :
A novel unsupervised reinforcement learning rule is introduced, based on the use of the supervised backpropagation algorithm as a component building block. The learning rule is easy to understand and implement in software and builds on the accumulated experience of researchers using backpropagation. Unlike most reinforcement learning systems, the new rule can operate with either continuously valued or binary outputs. It is very tolerant with respect to a wide variety of performance measures and is unrestricted in range and variability. The technique should find application in most reinforcement learning situations but should have particular benefit for learning control systems
Keywords :
learning systems; neural nets; accumulated experience; learning control systems; performance measures; supervised backpropagation; unsupervised reinforcement learning rule; Adaptive control; Application software; Control system synthesis; Control systems; Learning; Network synthesis; Neural networks; Optimal control; Performance evaluation; Programmable control;
Conference_Titel :
Neural Networks, 1991. 1991 IEEE International Joint Conference on
Print_ISBN :
0-7803-0227-3
DOI :
10.1109/IJCNN.1991.170625