Title :
Toward effective combination of off-line and on-line training in ADP framework
Author :
Prokhorov, Danil
Author_Institution :
Toyota Tech. Center, Ann Arbor, MI
Abstract :
We are interested in finding the most effective combination between off-line and on-line/real-time training in approximate dynamic programming. We introduce our approach of combining proven off-line methods of training for robustness with a group of on-line methods. Training for robustness is carried out on reasonably accurate models with the multi-stream Kalman filter method (Feldkamp et al., 1998), whereas on-line adaptation is performed either with the help of a critic or by methods resembling reinforcement learning. We also illustrate importance of using recurrent neural networks for both controller/actor and critic
Keywords :
Kalman filters; dynamic programming; learning (artificial intelligence); neurocontrollers; recurrent neural nets; robust control; approximate dynamic programming; multistream Kalman filter; offline training; online training; real-time training; recurrent neural network; reinforcement learning; robustness training; Adaptive control; Dynamic programming; Learning; Neural networks; Neurocontrollers; Programmable control; Recurrent neural networks; Robust control; Robustness; Uncertainty;
Conference_Titel :
Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0706-0
DOI :
10.1109/ADPRL.2007.368198