Approximate real-time optimal control based on sparse Gaussian process models

Author

Boedecker, Joschka ; Springenberg, Jost Tobias ; Wulfing, Jan ; Riedmiller, Martin

Author_Institution

Dept. of Comput. Sci., Univ. of Freiburg, Freiburg, Germany

fYear

2014

fDate

9-12 Dec. 2014

Firstpage

1

Lastpage

8

Abstract

In this paper we present a fully automated approach to (approximate) optimal control of non-linear systems. Our algorithm jointly learns a non-parametric model of the system dynamics - based on Gaussian Process Regression (GPR) - and performs receding horizon control using an adapted iterative LQR formulation. This results in an extremely data-efficient learning algorithm that can operate under real-time constraints. When combined with an exploration strategy based on GPR variance, our algorithm successfully learns to control two benchmark problems in simulation (two-link manipulator, cart-pole) as well as to swing-up and balance a real cart-pole system. For all considered problems learning from scratch, that is without prior knowledge provided by an expert, succeeds in less than 10 episodes of interaction with the system.

Keywords

Gaussian processes; learning systems; linear quadratic control; manipulators; nonlinear dynamical systems; regression analysis; GPR variance; Gaussian process regression; approximate real-time optimal control; cart-pole system; data-efficient learning algorithm; iterative LQR formulation; nonlinear systems; receding horizon control; sparse Gaussian process models; system dynamics nonparametric model; two-link manipulator; Approximation algorithms; Approximation methods; Computational modeling; Optimal control; Optimization; Predictive models; Trajectory;

fLanguage

English

Publisher

ieee

Conference_Titel

Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on

Conference_Location

Orlando, FL

Type

conf

DOI

10.1109/ADPRL.2014.7010608

Filename

7010608