Multi-objective reinforcement learning for AUV thruster failure recovery

Author

Ahmadzadeh, Seyed Reza ; Kormushev, Petar ; Caldwell, D.G.

Author_Institution

Dept. of Adv. Robot., Ist. Italiano di Tecnol., Genoa, Italy

fYear

2014

fDate

9-12 Dec. 2014

Firstpage

1

Lastpage

8

Abstract

This paper investigates learning approaches for discovering fault-tolerant control policies to overcome thruster failures in Autonomous Underwater Vehicles (AUV). The proposed approach is a model-based direct policy search that learns on an on-board simulated model of the vehicle. When a fault is detected and isolated the model of the AUV is reconfigured according to the new condition. To discover a set of optimal solutions a multi-objective reinforcement learning approach is employed which can deal with multiple conflicting objectives. Each optimal solution can be used to generate a trajectory that is able to navigate the AUV towards a specified target while satisfying multiple objectives. The discovered policies are executed on the robot in a closed-loop using AUV´s state feedback. Unlike most existing methods which disregard the faulty thruster, our approach can also deal with partially broken thrusters to increase the persistent autonomy of the AUV. In addition, the proposed approach is applicable when the AUV either becomes under-actuated or remains redundant in the presence of a fault. We validate the proposed approach on the model of the Girona500 AUV.

Keywords

autonomous underwater vehicles; closed loop systems; control engineering computing; fault diagnosis; learning (artificial intelligence); mobile robots; optimal control; state feedback; AUV state feedback; AUV thruster failure recovery; Girona500 AUV; autonomous underwater vehicles; closed-loop; conflicting objective; fault detection; fault-tolerant control policy; faulty thruster; model-based direct policy search; multiobjective reinforcement learning approach; on-board simulated model; optimal solution; Optimization; Sociology; Statistics; Trajectory; Vectors; Vehicle dynamics; Vehicles;

fLanguage

English

Publisher

ieee

Conference_Titel

Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on

Conference_Location

Orlando, FL

Type

conf

DOI

10.1109/ADPRL.2014.7010621

Filename

7010621