DocumentCode
2858340
Title
Common framework of certain reinforcement schedules
Author
Pacut, Andrzej
Author_Institution
Fac. of Electron. & Inf. Technol., Warsaw Univ. of Technol., Poland
Volume
3
fYear
1998
fDate
4-9 May 1998
Firstpage
2004
Abstract
We investigate reinforcement algorithms in a context of feedforward networks with gradient learning which use the smoothed output gradient estimators. The reduced network is introduced to avoid output redundancy. The adaptive critic element can be viewed as a network with smoothed output gradients, and the associative search elements the reduced network with smoothed output gradients. In this context, the adaptive critic element becomes a regular member of the family of adaptive critic designs
Keywords
adaptive control; adaptive systems; discrete time systems; feedforward neural nets; learning (artificial intelligence); neurocontrollers; adaptive critic element; associative search elements; feedforward networks; gradient learning; reduced network; reinforcement schedules; smoothed output gradient estimators; Adaptive systems; Control system synthesis; Control systems; Dynamic programming; Equations; Information resources; Information technology; Learning; Neural networks; Optimal control;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks Proceedings, 1998. IEEE World Congress on Computational Intelligence. The 1998 IEEE International Joint Conference on
Conference_Location
Anchorage, AK
ISSN
1098-7576
Print_ISBN
0-7803-4859-1
Type
conf
DOI
10.1109/IJCNN.1998.687167
Filename
687167
Link To Document