Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem

Author

Naiming Qi ; Xiaolei Sun ; Kang Sun ; Xingfu Liu ; Feng Wu ; Chao Liu

Author_Institution

Sch. of Astronaut., Harbin Inst. of Technol., Harbin, China

fYear

2014

Firstpage

1750

Lastpage

1754

Abstract

A methodology is presented in this paper for stochastic optimal control of unmanned aerial vehicle performing the task of perimeter patrol. The optimal control problem is modeled as a Markov decision processes, and an approximate policy iteration algorithm is used for the cost-to-go function (value function) by introducing Gaussian process regression, resulting in improved quality of the decisions made while retaining computationally feasibility. The approximate dynamic programming (ADP) framework is developed to tackle the issues, in which situations standard dynamic programming algorithms become computationally too demanding. As a nonparametric ADP algorithm, the Gaussian processes that provide the combination of the prior and noise models presents a sub-solution in a lower dimensional space by exploiting kernel-based method. The numerical results that corroborate the effectiveness of the proposed methodology are also provided.

Keywords

Gaussian processes; Markov processes; autonomous aerial vehicles; dynamic programming; iterative methods; optimal control; regression analysis; stochastic systems; ADP framework; Gaussian process regression; Markov decision process; approximate dynamic programming framework; approximate policy iteration algorithm; cost-to-go function; kernel-based method; lower dimensional space; nonparametric ADP algorithm; optimal control problem; perimeter patrol optimization problem; standard dynamic programming algorithm; stochastic optimal control; unmanned aerial vehicle; value function; Approximation algorithms; Approximation methods; Computational modeling; Dynamic programming; Gaussian processes; Ground penetrating radar; Heuristic algorithms; Gaussian process regression; Markov decision processes; approximate dynamic programming (ADP); perimeter patrol; unmanned air vehicle;

fLanguage

English

Publisher

ieee

Conference_Titel

Mechatronics and Control (ICMC), 2014 International Conference on

Print_ISBN

978-1-4799-2537-7

Type

conf

DOI

10.1109/ICMC.2014.7231861

Filename

7231861