مرکز منطقه ای اطلاع رساني علوم و فناوري - Temporal-difference learning for online reachability analysis

DocumentCode :

3693450

Title :

Temporal-difference learning for online reachability analysis

Author :

Anayo K. Akametalu;Claire J. Tomlin

Author_Institution :

Department of Electrical Engineering and Computer Sciences, University of California, Berkeley , 94720, USA

fYear :

2015

fDate :

7/1/2015 12:00:00 AM

Firstpage :

2508

Lastpage :

2513

Abstract :

Hamilton-Jacobi-Isaacs (HJI) reachability analysis has been employed to guarantee safety in a number of applications including robotics, air traffic control, and control of HVAC systems. The current standard for these methods can result in overly-conservative controllers that degrade system performance with respect to other objectives. There has been interest in incorporating online machine learning techniques to reduce the conservativeness of the controller. However, recent efforts have resulted in methods that are computationally inefficient and scale poorly with the dimension of the state space. We propose a novel online reachability update algorithm based on Temporal-Difference (TD) learning that is computationally more efficient. Our algorithm is demonstrated on a simulation of a quadrotor learning to track a trajectory in a confined space. Our method outperforms standard reachability-based controllers when it comes to other (non-safety) objectives.

Keywords :

"Safety","Reachability analysis","Games","Trajectory","Computational modeling","Mathematical model","Uncertainty"

Publisher :

ieee

Conference_Titel :

Control Conference (ECC), 2015 European

Type :

conf

DOI :

10.1109/ECC.2015.7330915

Filename :

7330915

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3693450