DocumentCode
1799300
Title
Approximate real-time optimal control based on sparse Gaussian process models
Author
Boedecker, Joschka ; Springenberg, Jost Tobias ; Wulfing, Jan ; Riedmiller, Martin
Author_Institution
Dept. of Comput. Sci., Univ. of Freiburg, Freiburg, Germany
fYear
2014
fDate
9-12 Dec. 2014
Firstpage
1
Lastpage
8
Abstract
In this paper we present a fully automated approach to (approximate) optimal control of non-linear systems. Our algorithm jointly learns a non-parametric model of the system dynamics - based on Gaussian Process Regression (GPR) - and performs receding horizon control using an adapted iterative LQR formulation. This results in an extremely data-efficient learning algorithm that can operate under real-time constraints. When combined with an exploration strategy based on GPR variance, our algorithm successfully learns to control two benchmark problems in simulation (two-link manipulator, cart-pole) as well as to swing-up and balance a real cart-pole system. For all considered problems learning from scratch, that is without prior knowledge provided by an expert, succeeds in less than 10 episodes of interaction with the system.
Keywords
Gaussian processes; learning systems; linear quadratic control; manipulators; nonlinear dynamical systems; regression analysis; GPR variance; Gaussian process regression; approximate real-time optimal control; cart-pole system; data-efficient learning algorithm; iterative LQR formulation; nonlinear systems; receding horizon control; sparse Gaussian process models; system dynamics nonparametric model; two-link manipulator; Approximation algorithms; Approximation methods; Computational modeling; Optimal control; Optimization; Predictive models; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location
Orlando, FL
Type
conf
DOI
10.1109/ADPRL.2014.7010608
Filename
7010608
Link To Document