مرکز منطقه ای اطلاع رساني علوم و فناوري - PRISCA: A policy search method for extreme trajectory following

DocumentCode :

3447639

Title :

PRISCA: A policy search method for extreme trajectory following

Author :

Lau, Tak Kit

Author_Institution :

Dept. of Mech. & Autom. Eng., Chinese Univ. of Hong Kong, Hong Kong, China

fYear :

2011

fDate :

12-15 Dec. 2011

Firstpage :

2856

Lastpage :

2862

Abstract :

Consider slide parking, given a desired demonstration, how to repeat it accurately? Many robotics tasks, such as slide parking, can be formulated in trajectory following, but not many dynamics of which can be easily modeled to facilitate a solving by the optimal control. Although an emerging stream in robotics is to learn the dynamics and policy from demonstrations, multiple, if not numerous, demonstrations are required. Therefore, learning a policy from scarce experience remains a difficult problem. In this paper, we proposed an online algorithm to learn a policy for control using only a desired demonstration and our intuitive knowledge of the dynamics system. Our approach is found on this observation: For trajectory following, even on a highly nonlinear and coupled dynamical system, so long as the state deviation is initially small, a policy can be updated online to keep the robot on track according to a very obvious and coarse model information (e.g., for driving, this information is simply: steer left to turn left). Our policy search is then devised as a function minimization problem, and is solved by gradient descent using the techniques of optimal baseline, least-state-deviation error, smoothing and in an inverse depreciation as a cost intensifier. Apart from guarantees on performance and convergence, we also demonstrated its performance in two simulations, and an extreme trajectory-following scenario - four-wheel-drive slide parking experiment. To our best knowledge, it is the state-of-the-art autonomous precision slide parking of a 4×4 brakeless RC car.

Keywords :

gradient methods; mobile robots; nonlinear dynamical systems; optimal control; robot dynamics; trajectory control; 4×4 brakeless RC car; PRISCA; autonomous precision slide parking; coarse model information; cost intensifier; coupled dynamical system; extreme trajectory following; four-wheel-drive slide parking experiment; function minimization problem; gradient descent; inverse depreciation; least-state-deviation error technique; nonlinear dynamical system; online algorithm; optimal baseline technique; optimal control; policy search method; robotics tasks; smoothing technique; Convergence; Cost function; Heuristic algorithms; Mathematical model; PD control; Robots; Trajectory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on

Conference_Location :

Orlando, FL

ISSN :

0743-1546

Print_ISBN :

978-1-61284-800-6

Electronic_ISBN :

0743-1546

Type :

conf

DOI :

10.1109/CDC.2011.6161518

Filename :

6161518

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3447639