Title :
Training neurocontrollers for robustness via nprKF
Author :
Prokhorov, Danil
Author_Institution :
Ford Res. & Adv. Eng., Dearborn, MI, USA
Abstract :
We are interested in training neurocontrollers for robustness on discrete-time models of physical systems. Our neurocontrollers are implemented as recurrent neural networks. A model of the system to be controlled is known to the extent of parameters and/or signal uncertainties. Parameter values are drawn from a known distribution. For each instance of the model with specified parameters, a neurocontroller is trained by evaluating sensitivities of the model outputs to perturbations of the neurocontroller weights and incrementally updating the weights. Our training process strives to minimize a quadratic cost function averaged over many different models. In the end this process yields a robust neurocontroller, which is ready for deployment with fixed weights. We employ a derivative-free Kalman filter algorithm proposed in Norgaard, N., et al. (2000) and extended in Feldkamp, L.A., et al. (2001, 2002) and to neural network training. Our training algorithm combines effectiveness of a second-order training method with universal applicability to both differentiable and nondifferentiable systems. Our approach is that of model reference control, and it is similar in this sense to the approach in Prokhorov, D.V., et al. (2001). We illustrate it with two examples.
Keywords :
Kalman filters; discrete time systems; learning (artificial intelligence); minimisation; model reference adaptive control systems; neurocontrollers; recurrent neural nets; robust control; sensitivity analysis; uncertain systems; derivative-free Kalman filter algorithm; differentiable systems; discrete-time models; incremental weight updating; model output sensitivity evaluation; model reference control; neural network training; neurocontroller training; neurocontroller weights perturbations; nondifferentiable systems; nprKF; parameter uncertainties; physical systems; quadratic cost function minimization; recurrent neural networks; robust neurocontroller; robustness; second-order training method; signal uncertainties; Automotive engineering; Calibration; Control system synthesis; Mathematical model; Neural networks; Neurocontrollers; Power system modeling; Recurrent neural networks; Robustness; Uncertainty;
Conference_Titel :
American Control Conference, 2005. Proceedings of the 2005
Print_ISBN :
0-7803-9098-9
Electronic_ISBN :
0743-1619
DOI :
10.1109/ACC.2005.1470150