DocumentCode
2242001
Title
Associative reinforcement learning for discrete-time optimal control
Author
Howell, M.N. ; Gordon, T.J.
Author_Institution
Dept. of Aeronaut. & Autom. Eng., Loughborough Univ. of Technol., UK
fYear
2000
fDate
2000
Firstpage
42370
Lastpage
42373
Abstract
This paper investigates the application of associative reinforcement learning techniques to the optimal control of linear discrete-time dynamic systems. Associative reinforcement learning involves the trial and error interaction with a dynamic system to determine the control actions that optimally achieve some desired performance index. The methodology can be applied either online or off-line and in a model based or model free manner. Associative reinforcement learning techniques are applied to the optimal regulator (LQR) control of discrete-time linear systems. Adaptive critic designs are implemented and the convergence speed compared for the different approaches. These methods can determine the optimal state and state/action value function and the optimal policy without requiring system models
Keywords
linear systems; LQR control; adaptive critic designs; associative reinforcement learning; convergence speed; discrete-time optimal control; dynamic system; linear discrete-time dynamic systems; optimal control; optimal regulator control; performance index; state/action value function; trial-and-error interaction;
fLanguage
English
Publisher
iet
Conference_Titel
Learning Systems for Control (Ref. No. 2000/069), IEE Seminar
Conference_Location
Birmingham
Type
conf
DOI
10.1049/ic:20000342
Filename
856946
Link To Document