• DocumentCode
    2342624
  • Title

    Autonomous blimp control using model-free reinforcement learning in a continuous state and action space

  • Author

    Rottmann, Axel ; Plagemann, Christian ; Hilgers, Peter ; Burgard, Wolfram

  • Author_Institution
    Univ. of Freiburg, Freiburg
  • fYear
    2007
  • fDate
    Oct. 29 2007-Nov. 2 2007
  • Firstpage
    1895
  • Lastpage
    1900
  • Abstract
    In this paper, we present an approach that applies the reinforcement learning principle to the problem of learning height control policies for aerial blimps. In contrast to previous approaches, our method does not require sophisticated hand- tuned models, but rather learns the policy online, which makes the system easily adaptable to changing conditions. The blimp we apply our approach to is a small-scale vehicle equipped with an ultrasound sensor that measures its elevation relative to the ground. The major problem in the context of learning control policies lies in the high-dimensional state-action space that needs to be explored in order to identify the values of all state-action pairs. In this paper, we propose a solution to learning continuous control policies based on the Gaussian process model. In practical experiments carried out on a real robot we demonstrate that the system is able to learn a policy online within a few minutes only.
  • Keywords
    Gaussian processes; aerospace control; aerospace robotics; learning (artificial intelligence); mobile robots; space vehicles; Gaussian process model; action space; aerial blimps; autonomous blimp control; continuous state; model-free reinforcement learning; ultrasound sensor; Gaussian processes; Intelligent robots; Land vehicles; Learning; Notice of Violation; Road vehicles; State estimation; USA Councils; Ultrasonic imaging; Ultrasonic variables measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ International Conference on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-1-4244-0912-9
  • Electronic_ISBN
    978-1-4244-0912-9
  • Type

    conf

  • DOI
    10.1109/IROS.2007.4399531
  • Filename
    4399531