• DocumentCode
    1799313
  • Title

    Multi-objective reinforcement learning for AUV thruster failure recovery

  • Author

    Ahmadzadeh, Seyed Reza ; Kormushev, Petar ; Caldwell, D.G.

  • Author_Institution
    Dept. of Adv. Robot., Ist. Italiano di Tecnol., Genoa, Italy
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper investigates learning approaches for discovering fault-tolerant control policies to overcome thruster failures in Autonomous Underwater Vehicles (AUV). The proposed approach is a model-based direct policy search that learns on an on-board simulated model of the vehicle. When a fault is detected and isolated the model of the AUV is reconfigured according to the new condition. To discover a set of optimal solutions a multi-objective reinforcement learning approach is employed which can deal with multiple conflicting objectives. Each optimal solution can be used to generate a trajectory that is able to navigate the AUV towards a specified target while satisfying multiple objectives. The discovered policies are executed on the robot in a closed-loop using AUV´s state feedback. Unlike most existing methods which disregard the faulty thruster, our approach can also deal with partially broken thrusters to increase the persistent autonomy of the AUV. In addition, the proposed approach is applicable when the AUV either becomes under-actuated or remains redundant in the presence of a fault. We validate the proposed approach on the model of the Girona500 AUV.
  • Keywords
    autonomous underwater vehicles; closed loop systems; control engineering computing; fault diagnosis; learning (artificial intelligence); mobile robots; optimal control; state feedback; AUV state feedback; AUV thruster failure recovery; Girona500 AUV; autonomous underwater vehicles; closed-loop; conflicting objective; fault detection; fault-tolerant control policy; faulty thruster; model-based direct policy search; multiobjective reinforcement learning approach; on-board simulated model; optimal solution; Optimization; Sociology; Statistics; Trajectory; Vectors; Vehicle dynamics; Vehicles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
  • Conference_Location
    Orlando, FL
  • Type

    conf

  • DOI
    10.1109/ADPRL.2014.7010621
  • Filename
    7010621