• DocumentCode
    1536318
  • Title

    Control of Unknown Nonlinear Systems With Efficient Transient Performance Using Concurrent Exploitation and Exploration

  • Author

    Kosmatopoulos, E.B.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Democritus Univ. of Thrace, Thessaloniki, Greece
  • Volume
    21
  • Issue
    8
  • fYear
    2010
  • Firstpage
    1245
  • Lastpage
    1261
  • Abstract
    Learning mechanisms that operate in unknown environments should be able to efficiently deal with the problem of controlling unknown dynamical systems. Many approaches that deal with such a problem face the so-called exploitation-exploration dilemma where the controller has to sacrifice efficient performance for the sake of learning “better” control strategies than the ones already known: during the exploration period, poor or even unstable closed-loop system performance may be exhibited. In this paper, we show that, in the case where the control goal is to stabilize an unknown dynamical system by means of state feedback, exploitation and exploration can be concurrently performed without the need of sacrificing efficiency. This is made possible through an appropriate combination of recent results developed by the author in the areas of adaptive control and adaptive optimization and a new result on the convex construction of control Lyapunov functions for nonlinear systems. The resulting scheme guarantees arbitrarily good performance in the regions where the system is controllable. Theoretical analysis as well as simulation results on a particularly challenging control problem verify such a claim.
  • Keywords
    Lyapunov methods; adaptive control; closed loop systems; nonlinear control systems; optimisation; state feedback; Lyapunov function; adaptive control; adaptive optimization; closed-loop system; concurrent exploitation; concurrent exploration; nonlinear system; state feedback; transient performance; Adaptive control; Analytical models; Control systems; Learning systems; Lyapunov method; Nonlinear control systems; Nonlinear systems; Programmable control; State feedback; System performance; Control Lyapunov function (CLF); exploitation vs exploration; high-order neural networks (HONN); persistence of excitation (PE); sum-of-squares (SoS); Adaptation, Physiological; Algorithms; Animals; Artificial Intelligence; Feedback; Humans; Mathematical Computing; Neural Networks (Computer); Nonlinear Dynamics;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/TNN.2010.2050211
  • Filename
    5510184