Title :
Gaussian Processes and Reinforcement Learning for Identification and Control of an Autonomous Blimp
Author :
Ko, Jonathan ; Klein, Daniel J. ; Fox, Dieter ; Haehnel, Dirk
Author_Institution :
Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA
Abstract :
Blimps are a promising platform for aerial robotics and have been studied extensively for this purpose. Unlike other aerial vehicles, blimps are relatively safe and also possess the ability to loiter for long periods. These advantages, however, have been difficult to exploit because blimp dynamics are complex and inherently non-linear. The classical approach to system modeling represents the system as an ordinary differential equation (ODE) based on Newtonian principles. A more recent modeling approach is based on representing state transitions as a Gaussian process (GP). In this paper, we present a general technique for system identification that combines these two modeling approaches into a single formulation. This is done by training a Gaussian process on the residual between the non-linear model and ground truth training data. The result is a GP-enhanced model that provides an estimate of uncertainty in addition to giving better state predictions than either ODE or GP alone. We show how the GP-enhanced model can be used in conjunction with reinforcement learning to generate a blimp controller that is superior to those learned with ODE or GP models alone.
Keywords :
Gaussian processes; aerospace robotics; differential equations; identification; learning (artificial intelligence); mobile robots; nonlinear control systems; robot dynamics; Gaussian process; Newtonian principles; aerial robotics; aerial vehicles; autonomous blimp control; blimp dynamics; nonlinear dynamics; ordinary differential equation; reinforcement learning; system identification; uncertainty estimation; Differential equations; Gaussian processes; Learning; Modeling; Nonlinear dynamical systems; Remotely operated vehicles; Robots; System identification; Vehicle dynamics; Vehicle safety;
Conference_Titel :
Robotics and Automation, 2007 IEEE International Conference on
Conference_Location :
Roma
Print_ISBN :
1-4244-0601-3
Electronic_ISBN :
1050-4729
DOI :
10.1109/ROBOT.2007.363075