مرکز منطقه ای اطلاع رساني علوم و فناوري - Accelerating critic learning in approximate dynamic programming via value templates and perceptual learning

DocumentCode :

1929401

Title :

Accelerating critic learning in approximate dynamic programming via value templates and perceptual learning

Author :

Shannon, Thaddeus T. ; Santiago, Roberto A. ; George, G.L.

Volume :

fYear :

2003

fDate :

20-24 July 2003

Firstpage :

2922

Abstract :

The concept of value templates and perceptual learning are introduced as refinements to the reinforcement learning (RL) paradigm. We demonstrate a method for accelerating dual heuristic programming (DHP) critic training using value templates and perceptual learning. Both faster and more stable learning are achieved by using the value template and utilizing its inherent constraints to regularize the perceptual learning task. The method is demonstrated by tuning a neurofuzzy control system for a highly nonlinear 2^nd order plant proposed by Sanner and Slotine. We take advantage of the TSK model framework throughout to keep the controller, critic, and model components used in DHP highly interpretable.

Keywords :

dynamic programming; fuzzy neural nets; heuristic programming; learning (artificial intelligence); neurocontrollers; nonlinear control systems; TSK model framework; approximate dynamic programming; dual heuristic programming critic training; highly nonlinear 2nd order plant; neurofuzzy control system; perceptual learning; perceptual learning task; reinforcement learning paradigm; value templates; Acceleration; Central nervous system; Decision making; Dynamic programming; Fuzzy control; Hippocampus; Intelligent systems; Laboratories; Learning; Optimal control;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 2003. Proceedings of the International Joint Conference on

ISSN :

1098-7576

Print_ISBN :

0-7803-7898-9

Type :

conf

DOI :

10.1109/IJCNN.2003.1224035

Filename :

1224035

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1929401