DocumentCode
2633693
Title
Reinforcement learning using back-propagation as a building block
Author
Mills, Peter M. ; Zomaya, Albert Y.
Author_Institution
CRA Adv. Tech. Dev., Cannington, WA, Australia
fYear
1991
fDate
18-21 Nov 1991
Firstpage
1554
Abstract
A novel unsupervised reinforcement learning rule is introduced, based on the use of the supervised backpropagation algorithm as a component building block. The learning rule is easy to understand and implement in software and builds on the accumulated experience of researchers using backpropagation. Unlike most reinforcement learning systems, the new rule can operate with either continuously valued or binary outputs. It is very tolerant with respect to a wide variety of performance measures and is unrestricted in range and variability. The technique should find application in most reinforcement learning situations but should have particular benefit for learning control systems
Keywords
learning systems; neural nets; accumulated experience; learning control systems; performance measures; supervised backpropagation; unsupervised reinforcement learning rule; Adaptive control; Application software; Control system synthesis; Control systems; Learning; Network synthesis; Neural networks; Optimal control; Performance evaluation; Programmable control;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 1991. 1991 IEEE International Joint Conference on
Print_ISBN
0-7803-0227-3
Type
conf
DOI
10.1109/IJCNN.1991.170625
Filename
170625
Link To Document