DocumentCode :
1047636
Title :
Training Recurrent Neurocontrollers for Real-Time Applications
Author :
Prokhorov, Danil V.
Author_Institution :
Toyota Motor Eng. & Manufacturing North America, Ann Arbor
Volume :
18
Issue :
4
fYear :
2007
fDate :
7/1/2007 12:00:00 AM
Firstpage :
1003
Lastpage :
1015
Abstract :
In this paper, we introduce a new approach to train recurrent neurocontrollers for real-time applications. We begin with training a recurrent neurocontroller for robustness on high-fidelity models of physical systems. For training, we use a recently developed derivative-free Kalman filter method which we enhance for controller training. After training, we fix weights of our recurrent neurocontroller and deploy it in an embedded environment. Then, we carry out additional training of the neurocontroller by adapting in real time its internal state (short-term memory), rather than its weights (long-term memory). Such real-time training is done with a new combination of simultaneous perturbation stochastic approximation (SPSA) and adaptive critic. Our critic is also a recurrent neural network (RNN), and it is trained by stochastic meta-descent (SMD) for increased efficiency. Our approach is applied to two important practical problems, electronic throttle control and hybrid electric vehicle control, with apparent performance improvement.
Keywords :
Kalman filters; neurocontrollers; perturbation techniques; recurrent neural nets; robust control; stochastic processes; Kalman filter; adaptive critic; controller training; electronic throttle control; high-fidelity model; hybrid electric vehicle control; physical system; real-time application; recurrent neural network; recurrent neurocontrollers; robustness; simultaneous perturbation stochastic approximation; stochastic metadescent; Control systems; Neurocontrollers; Neurofeedback; Nonlinear dynamical systems; Output feedback; Recurrent neural networks; Robustness; Stochastic processes; Uncertainty; Vehicle dynamics; Neurocontroller; real-time adaptation; recurrent neural network (RNN); simultaneous perturbation stochastic approximation (SPSA); stochastic meta-descent (SMD); training for robustness; Algorithms; Computer Simulation; Computer Systems; Decision Support Techniques; Feedback; Models, Theoretical; Neural Networks (Computer);
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/TNN.2007.899521
Filename :
4267717
Link To Document :
بازگشت