DocumentCode :
3486774
Title :
Efficient robust policy optimization
Author :
Atkeson, Christopher G.
Author_Institution :
Robot. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2012
fDate :
27-29 June 2012
Firstpage :
5220
Lastpage :
5227
Abstract :
We provide efficient algorithms to calculate first and second order gradients of the cost of a control law with respect to its parameters, to speed up policy optimization. We achieve robustness by simultaneously designing one control law for multiple models with potentially different model structures, which represent model uncertainty and unmodeled dynamics. Providing explicit examples of possible unmodeled dynamics during the control design process is easier for the designer and is more effective than providing simulated perturbations to increase robustness, as is currently done in machine learning. Our approach supports the design of deterministic nonlinear and time varying controllers for both deterministic and stochastic nonlinear and time varying systems, including policies with internal state such as observers or other state estimators. We highlight the benefit of control laws made up of collections of simple policies where only one component policy is active at a time. Controller optimization and learning is particularly fast and effective in this situation because derivatives are decoupled.
Keywords :
control system synthesis; learning (artificial intelligence); nonlinear control systems; optimisation; state estimation; stochastic systems; control design process; control law design; controller optimization; deterministic nonlinear controllers; deterministic systems; efficient robust policy optimization; first order gradients; internal state; machine learning; model structures; model uncertainty; multiple models; observers; second order gradients; simulated perturbations; state estimators; stochastic nonlinear systems; time varying controllers; time varying systems; unmodeled dynamics; Computational modeling; Cost function; Equations; Mathematical model; Trajectory; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
American Control Conference (ACC), 2012
Conference_Location :
Montreal, QC
ISSN :
0743-1619
Print_ISBN :
978-1-4577-1095-7
Electronic_ISBN :
0743-1619
Type :
conf
DOI :
10.1109/ACC.2012.6315619
Filename :
6315619
Link To Document :
بازگشت