Title :
Observations on the practical use of adaptive critics
Author :
Feldkamp, Lee A. ; Prokhorov, Danil V.
Author_Institution :
Res. Lab., Ford Motor Co., Dearborn, MI, USA
Abstract :
By studying adaptive critic designs (ACD) from the standpoint of practical use in training neural networks, we expect to establish the types of problems for which ACD might be preferable to more established methods. To restrict the scope, we have chosen to concentrate on applying ACD, specifically derivative critics, to the training of recurrent networks (L.A. FeldKamp et al., 1997). This is actually less restrictive than it may appear; many problems, including controller training, can be posed as optimizing some or all of the weights of a recurrent network. An immediate benefit of this focus has been to clarify the relationship between the derivatives that result from backpropagation through time (BPTT) and the quantities that derivative critics are expected to deliver. At the same time, many questions have been raised, such as that of the critic representation that best balances accuracy against the number of time steps required for adaptation. Because our formulation permits BPTT and derivative critics to be used together or separately, we expect that experience with a variety of problems will further clarify the various tradeoffs and suggest situations in which critics may be used to particular advantage
Keywords :
adaptive systems; backpropagation; recurrent neural nets; adaptive critic designs; backpropagation through time; controller training; critic representation; derivative critics; neural network training; recurrent networks; Backpropagation; Computational intelligence; Computer networks; Control systems; Cost function; Differential equations; Dynamic programming; Laboratories; Neural networks; Supervised learning;
Conference_Titel :
Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation., 1997 IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-4053-1
DOI :
10.1109/ICSMC.1997.633057