Reinforcement learning using back-propagation as a building block

Author

Mills, Peter M. ; Zomaya, Albert Y.

Author_Institution

CRA Adv. Tech. Dev., Cannington, WA, Australia

fYear

1991

fDate

18-21 Nov 1991

Firstpage

1554

Abstract

A novel unsupervised reinforcement learning rule is introduced, based on the use of the supervised backpropagation algorithm as a component building block. The learning rule is easy to understand and implement in software and builds on the accumulated experience of researchers using backpropagation. Unlike most reinforcement learning systems, the new rule can operate with either continuously valued or binary outputs. It is very tolerant with respect to a wide variety of performance measures and is unrestricted in range and variability. The technique should find application in most reinforcement learning situations but should have particular benefit for learning control systems

Keywords

learning systems; neural nets; accumulated experience; learning control systems; performance measures; supervised backpropagation; unsupervised reinforcement learning rule; Adaptive control; Application software; Control system synthesis; Control systems; Learning; Network synthesis; Neural networks; Optimal control; Performance evaluation; Programmable control;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 1991. 1991 IEEE International Joint Conference on

Print_ISBN

0-7803-0227-3

Type

conf

DOI

10.1109/IJCNN.1991.170625

Filename

170625

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2633693