DocumentCode
2758662
Title
Approximate policy improvement for continuous action set-point regulation problems
Author
Esogbue, A.O. ; Hearnes, W.E., II
Author_Institution
Sch. of Ind. & Syst. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Volume
2
fYear
1998
fDate
4-9 May 1998
Firstpage
1684
Abstract
Model-free control methods based on classical dynamic programming approximation algorithms are a fertile area of research. These methods are generally for discrete state and action spaces. This research proposes an algorithm for extending approximate policy iteration to continuous action spaces by combining the derivative-free line search methods of nonlinear optimization with policy improvement based on Q-learning using a discrete subset of reference actions. The properties of the proposed algorithm are discussed and two example problems illustrate its applicability. Q-learning algorithms are used to search a continuous action space. The method reduces the computational effort required in many problems
Keywords
dynamic programming; optimal control; search problems; Q-learning; approximate policy improvement; classical dynamic programming approximation algorithms; continuous action set-point regulation problems; derivative-free line search methods; model-free control methods; nonlinear optimization; Control systems; Dynamic programming; Intelligent control; Intelligent systems; Laboratories; Learning; Power system dynamics; Power system modeling; Process control; Space technology;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems Proceedings, 1998. IEEE World Congress on Computational Intelligence., The 1998 IEEE International Conference on
Conference_Location
Anchorage, AK
ISSN
1098-7584
Print_ISBN
0-7803-4863-X
Type
conf
DOI
10.1109/FUZZY.1998.686375
Filename
686375
Link To Document