DocumentCode
3576073
Title
Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem
Author
Naiming Qi ; Xiaolei Sun ; Kang Sun ; Xingfu Liu ; Feng Wu ; Chao Liu
Author_Institution
Sch. of Astronaut., Harbin Inst. of Technol., Harbin, China
fYear
2014
Firstpage
1750
Lastpage
1754
Abstract
A methodology is presented in this paper for stochastic optimal control of unmanned aerial vehicle performing the task of perimeter patrol. The optimal control problem is modeled as a Markov decision processes, and an approximate policy iteration algorithm is used for the cost-to-go function (value function) by introducing Gaussian process regression, resulting in improved quality of the decisions made while retaining computationally feasibility. The approximate dynamic programming (ADP) framework is developed to tackle the issues, in which situations standard dynamic programming algorithms become computationally too demanding. As a nonparametric ADP algorithm, the Gaussian processes that provide the combination of the prior and noise models presents a sub-solution in a lower dimensional space by exploiting kernel-based method. The numerical results that corroborate the effectiveness of the proposed methodology are also provided.
Keywords
Gaussian processes; Markov processes; autonomous aerial vehicles; dynamic programming; iterative methods; optimal control; regression analysis; stochastic systems; ADP framework; Gaussian process regression; Markov decision process; approximate dynamic programming framework; approximate policy iteration algorithm; cost-to-go function; kernel-based method; lower dimensional space; nonparametric ADP algorithm; optimal control problem; perimeter patrol optimization problem; standard dynamic programming algorithm; stochastic optimal control; unmanned aerial vehicle; value function; Approximation algorithms; Approximation methods; Computational modeling; Dynamic programming; Gaussian processes; Ground penetrating radar; Heuristic algorithms; Gaussian process regression; Markov decision processes; approximate dynamic programming (ADP); perimeter patrol; unmanned air vehicle;
fLanguage
English
Publisher
ieee
Conference_Titel
Mechatronics and Control (ICMC), 2014 International Conference on
Print_ISBN
978-1-4799-2537-7
Type
conf
DOI
10.1109/ICMC.2014.7231861
Filename
7231861
Link To Document