مرکز منطقه ای اطلاع رساني علوم و فناوري - Toward effective combination of off-line and on-line training in ADP framework

DocumentCode :

2717636

Title :

Toward effective combination of off-line and on-line training in ADP framework

Author :

Prokhorov, Danil

Author_Institution :

Toyota Tech. Center, Ann Arbor, MI

fYear :

2007

fDate :

1-5 April 2007

Firstpage :

268

Lastpage :

271

Abstract :

We are interested in finding the most effective combination between off-line and on-line/real-time training in approximate dynamic programming. We introduce our approach of combining proven off-line methods of training for robustness with a group of on-line methods. Training for robustness is carried out on reasonably accurate models with the multi-stream Kalman filter method (Feldkamp et al., 1998), whereas on-line adaptation is performed either with the help of a critic or by methods resembling reinforcement learning. We also illustrate importance of using recurrent neural networks for both controller/actor and critic

Keywords :

Kalman filters; dynamic programming; learning (artificial intelligence); neurocontrollers; recurrent neural nets; robust control; approximate dynamic programming; multistream Kalman filter; offline training; online training; real-time training; recurrent neural network; reinforcement learning; robustness training; Adaptive control; Dynamic programming; Learning; Neural networks; Neurocontrollers; Programmable control; Recurrent neural networks; Robust control; Robustness; Uncertainty;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on

Conference_Location :

Honolulu, HI

Print_ISBN :

1-4244-0706-0

Type :

conf

DOI :

10.1109/ADPRL.2007.368198

Filename :

4220843

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2717636