DocumentCode :
250857
Title :
Monte Carlo methods for exact & efficient solution of the generalized optimality equations
Author :
Ortega, Pedro A. ; Braun, Daniel A. ; Tishby, Naftali
Author_Institution :
GRASP Robot. Lab., Univ. of Pennsylvania, Philadelphia, PA, USA
fYear :
2014
fDate :
May 31 2014-June 7 2014
Firstpage :
4322
Lastpage :
4327
Abstract :
Previous work has shown that classical sequential decision making rules, including expectimax and minimax, are limit cases of a more general class of bounded rational planning problems that trade off the value and the complexity of the solution, as measured by its information divergence from a given reference. This allows modeling a range of novel planning problems having varying degrees of control due to resource constraints, risk-sensitivity, trust and model uncertainty. However, so far it has been unclear in what sense information constraints relate to the complexity of planning. In this paper, we introduce Monte Carlo methods to solve the generalized optimality equations in an efficient & exact way when the inverse temperatures in a generalized decision tree are of the same sign. These methods highlight a fundamental relation between inverse temperatures and the number of Monte Carlo proposals. In particular, it is seen that the number of proposals is essentially independent of the size of the decision tree.
Keywords :
Monte Carlo methods; decision making; decision trees; optimisation; planning; Monte Carlo methods; Monte Carlo proposals; bounded rational planning problems; expectimax; generalized decision tree; generalized optimality equations; information constraints; information divergence; inverse temperatures; minimax; model uncertainty; planning complexity; resource constraints; risk-sensitivity; sequential decision making rules; trust; Decision trees; Equations; Monte Carlo methods; Planning; Proposals; Robots; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2014 IEEE International Conference on
Conference_Location :
Hong Kong
Type :
conf
DOI :
10.1109/ICRA.2014.6907488
Filename :
6907488
Link To Document :
بازگشت