مرکز منطقه ای اطلاع رساني علوم و فناوري - Training Recurrent Neurocontrollers for Real-Time Applications

DocumentCode :

1047636

Title :

Training Recurrent Neurocontrollers for Real-Time Applications

Author :

Prokhorov, Danil V.

Author_Institution :

Toyota Motor Eng. & Manufacturing North America, Ann Arbor

Volume :

Issue :

fYear :

2007

fDate :

7/1/2007 12:00:00 AM

Firstpage :

1003

Lastpage :

1015

Abstract :

In this paper, we introduce a new approach to train recurrent neurocontrollers for real-time applications. We begin with training a recurrent neurocontroller for robustness on high-fidelity models of physical systems. For training, we use a recently developed derivative-free Kalman filter method which we enhance for controller training. After training, we fix weights of our recurrent neurocontroller and deploy it in an embedded environment. Then, we carry out additional training of the neurocontroller by adapting in real time its internal state (short-term memory), rather than its weights (long-term memory). Such real-time training is done with a new combination of simultaneous perturbation stochastic approximation (SPSA) and adaptive critic. Our critic is also a recurrent neural network (RNN), and it is trained by stochastic meta-descent (SMD) for increased efficiency. Our approach is applied to two important practical problems, electronic throttle control and hybrid electric vehicle control, with apparent performance improvement.

Keywords :

Kalman filters; neurocontrollers; perturbation techniques; recurrent neural nets; robust control; stochastic processes; Kalman filter; adaptive critic; controller training; electronic throttle control; high-fidelity model; hybrid electric vehicle control; physical system; real-time application; recurrent neural network; recurrent neurocontrollers; robustness; simultaneous perturbation stochastic approximation; stochastic metadescent; Control systems; Neurocontrollers; Neurofeedback; Nonlinear dynamical systems; Output feedback; Recurrent neural networks; Robustness; Stochastic processes; Uncertainty; Vehicle dynamics; Neurocontroller; real-time adaptation; recurrent neural network (RNN); simultaneous perturbation stochastic approximation (SPSA); stochastic meta-descent (SMD); training for robustness; Algorithms; Computer Simulation; Computer Systems; Decision Support Techniques; Feedback; Models, Theoretical; Neural Networks (Computer);

fLanguage :

English

Journal_Title :

Neural Networks, IEEE Transactions on

Publisher :

ieee

ISSN :

1045-9227

Type :

jour

DOI :

10.1109/TNN.2007.899521

Filename :

4267717

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1047636