Title :
Primitive adaptive critics
Author :
Prokhorov, Danil V. ; Feldkamp, Lee A.
Author_Institution :
Dept. of Electr. Eng., Texas Tech. Univ., Lubbock, TX, USA
Abstract :
We propose a simple framework for critic-based training of recurrent neural networks and feedback controllers. We term the critics that are used primitive adaptive critics, since we represent them with the simplest possible architecture (bias weight only). We derive this framework from two main premises. The first of these is a natural similarity between a form of approximate dynamic programming, called dual heuristic programming, and backpropagation through time (BPTT), which are discussed. The second premise is our emphasis on a development of a truly online critic-based training procedure competitive in performance and computational cost to truncated BPTT. Three examples illustrate the main features of the framework proposed
Keywords :
backpropagation; duality (mathematics); dynamic programming; feedback; model reference adaptive control systems; neurocontrollers; real-time systems; recurrent neural nets; approximate dynamic programming; backpropagation through time; critic-based learning; dual heuristic programming; feedback controllers; model reference adaptive control; online learning; primitive adaptive critics; recurrent neural networks; Adaptive control; Backpropagation; Computational efficiency; Computational intelligence; Cost function; Dynamic programming; Equations; Function approximation; Laboratories; Recurrent neural networks;
Conference_Titel :
Neural Networks,1997., International Conference on
Conference_Location :
Houston, TX
Print_ISBN :
0-7803-4122-8
DOI :
10.1109/ICNN.1997.614396