DocumentCode :
716742
Title :
Optimal sensing via multi-armed bandit relaxations in mixed observability domains
Author :
Lauri, Mikko ; Ritala, Risto
Author_Institution :
Dept. of Autom. Sci. & Eng., Tampere Univ. of Technol., Tampere, Finland
fYear :
2015
fDate :
26-30 May 2015
Firstpage :
4807
Lastpage :
4812
Abstract :
Sequential decision making under uncertainty is studied in a mixed observability domain. The goal is to maximize the amount of information obtained on a partially observable stochastic process under constraints imposed by a fully observable internal state. An upper bound for the optimal value function is derived by relaxing constraints. We identify conditions under which the relaxed problem is a multi-armed bandit whose optimal policy is easily computable. The upper bound is applied to prune the search space in the original problem, and the effect on solution quality is assessed via simulation experiments. Empirical results show effective pruning of the search space in a target monitoring domain.
Keywords :
decision making; robots; sensors; stochastic processes; fully observable internal state; mixed observability domains; multiarmed bandit relaxations; optimal sensing; optimal value function; partially observable stochastic process; search space pruning; sequential decision making; target monitoring domain; upper bound; Indexes; Observability; Optimization; Robot sensing systems; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2015 IEEE International Conference on
Conference_Location :
Seattle, WA
Type :
conf
DOI :
10.1109/ICRA.2015.7139867
Filename :
7139867
Link To Document :
بازگشت