• DocumentCode
    1721416
  • Title

    Helicopter velocity tracking control by adaptive actor-critic reinforcement method

  • Author

    Hu, Yang ; Chen, Yang ; Han, Jianda ; Wang, Yuechao ; Qi, Juntong

  • Author_Institution
    State Key Lab. of Robot., Shenyang Inst. of Autom. (SIA), Shenyang, China
  • fYear
    2011
  • Firstpage
    1482
  • Lastpage
    1486
  • Abstract
    A robotic helicopter is an aircraft equipped with a sensing, computing, actuation, and communication infrastructure that allows it to execute a variety of tasks with autonomous mode. In this paper, we present an adaptive actor-critic reinforcement method to obtain near optimal controller for small autonomous helicopter. A network based on Q-value performs the critic and is trained by SARSA algorithm. A BP neural network, which is the actor network, generates control signal of helicopter dynamics. First, the proposed actor-critic reinforcement controller is introduced, then the algorithm is applied to an unmanned helicopter known as a highly nonlinear and complex system and the simulation results are presented.
  • Keywords
    autonomous aerial vehicles; backpropagation; large-scale systems; learning (artificial intelligence); neurocontrollers; nonlinear systems; optimal control; vehicle dynamics; velocity control; BP neural network; Q-value; SARSA algorithm; actor network; actor-critic reinforcement controller; actuation infrastructure; adaptive actor-critic reinforcement method; autonomous helicopter; autonomous mode; communication infrastructure; complex system; computing infrastructure; helicopter dynamics control signal; helicopter velocity tracking control; nonlinear system; optimal controller; robotic helicopter; sensing infrastructure; unmanned helicopter; Helicopters; Learning; Mathematical model; Robots; Training; Vectors; Vehicle dynamics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Biomimetics (ROBIO), 2011 IEEE International Conference on
  • Conference_Location
    Karon Beach, Phuket
  • Print_ISBN
    978-1-4577-2136-6
  • Type

    conf

  • DOI
    10.1109/ROBIO.2011.6181499
  • Filename
    6181499