• Title of article

    Approximate dynamic programming with a fuzzy parameterization

  • Author/Authors

    Bu?oniu، نويسنده , , Lucian and Ernst، نويسنده , , Damien and De Schutter، نويسنده , , Bart and Babu?ka، نويسنده , , Robert، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2010
  • Pages
    11
  • From page
    804
  • To page
    814
  • Abstract
    Dynamic programming (DP) is a powerful paradigm for general, nonlinear optimal control. Computing exact DP solutions is in general only possible when the process states and the control actions take values in a small discrete set. In practice, it is necessary to approximate the solutions. Therefore, we propose an algorithm for approximate DP that relies on a fuzzy partition of the state space, and on a discretization of the action space. This fuzzy Q-iteration algorithm works for deterministic processes, under the discounted return criterion. We prove that fuzzy Q-iteration asymptotically converges to a solution that lies within a bound of the optimal solution. A bound on the suboptimality of the solution obtained in a finite number of iterations is also derived. Under continuity assumptions on the dynamics and on the reward function, we show that fuzzy Q-iteration is consistent, i.e., that it asymptotically obtains the optimal solution as the approximation accuracy increases. These properties hold both when the parameters of the approximator are updated in a synchronous fashion, and when they are updated asynchronously. The asynchronous algorithm is proven to converge at least as fast as the synchronous one. The performance of fuzzy Q-iteration is illustrated in a two-link manipulator control problem.
  • Keywords
    Approximate Dynamic Programming , Fuzzy approximation , Convergence analysis , value iteration
  • Journal title
    Automatica
  • Serial Year
    2010
  • Journal title
    Automatica
  • Record number

    1448014