• DocumentCode
    3576073
  • Title

    Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem

  • Author

    Naiming Qi ; Xiaolei Sun ; Kang Sun ; Xingfu Liu ; Feng Wu ; Chao Liu

  • Author_Institution
    Sch. of Astronaut., Harbin Inst. of Technol., Harbin, China
  • fYear
    2014
  • Firstpage
    1750
  • Lastpage
    1754
  • Abstract
    A methodology is presented in this paper for stochastic optimal control of unmanned aerial vehicle performing the task of perimeter patrol. The optimal control problem is modeled as a Markov decision processes, and an approximate policy iteration algorithm is used for the cost-to-go function (value function) by introducing Gaussian process regression, resulting in improved quality of the decisions made while retaining computationally feasibility. The approximate dynamic programming (ADP) framework is developed to tackle the issues, in which situations standard dynamic programming algorithms become computationally too demanding. As a nonparametric ADP algorithm, the Gaussian processes that provide the combination of the prior and noise models presents a sub-solution in a lower dimensional space by exploiting kernel-based method. The numerical results that corroborate the effectiveness of the proposed methodology are also provided.
  • Keywords
    Gaussian processes; Markov processes; autonomous aerial vehicles; dynamic programming; iterative methods; optimal control; regression analysis; stochastic systems; ADP framework; Gaussian process regression; Markov decision process; approximate dynamic programming framework; approximate policy iteration algorithm; cost-to-go function; kernel-based method; lower dimensional space; nonparametric ADP algorithm; optimal control problem; perimeter patrol optimization problem; standard dynamic programming algorithm; stochastic optimal control; unmanned aerial vehicle; value function; Approximation algorithms; Approximation methods; Computational modeling; Dynamic programming; Gaussian processes; Ground penetrating radar; Heuristic algorithms; Gaussian process regression; Markov decision processes; approximate dynamic programming (ADP); perimeter patrol; unmanned air vehicle;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mechatronics and Control (ICMC), 2014 International Conference on
  • Print_ISBN
    978-1-4799-2537-7
  • Type

    conf

  • DOI
    10.1109/ICMC.2014.7231861
  • Filename
    7231861