• DocumentCode
    2242001
  • Title

    Associative reinforcement learning for discrete-time optimal control

  • Author

    Howell, M.N. ; Gordon, T.J.

  • Author_Institution
    Dept. of Aeronaut. & Autom. Eng., Loughborough Univ. of Technol., UK
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    42370
  • Lastpage
    42373
  • Abstract
    This paper investigates the application of associative reinforcement learning techniques to the optimal control of linear discrete-time dynamic systems. Associative reinforcement learning involves the trial and error interaction with a dynamic system to determine the control actions that optimally achieve some desired performance index. The methodology can be applied either online or off-line and in a model based or model free manner. Associative reinforcement learning techniques are applied to the optimal regulator (LQR) control of discrete-time linear systems. Adaptive critic designs are implemented and the convergence speed compared for the different approaches. These methods can determine the optimal state and state/action value function and the optimal policy without requiring system models
  • Keywords
    linear systems; LQR control; adaptive critic designs; associative reinforcement learning; convergence speed; discrete-time optimal control; dynamic system; linear discrete-time dynamic systems; optimal control; optimal regulator control; performance index; state/action value function; trial-and-error interaction;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Learning Systems for Control (Ref. No. 2000/069), IEE Seminar
  • Conference_Location
    Birmingham
  • Type

    conf

  • DOI
    10.1049/ic:20000342
  • Filename
    856946