مرکز منطقه ای اطلاع رساني علوم و فناوري - Optimal sensing via multi-armed bandit relaxations in mixed observability domains

DocumentCode :

716742

Title :

Optimal sensing via multi-armed bandit relaxations in mixed observability domains

Author :

Lauri, Mikko ; Ritala, Risto

Author_Institution :

Dept. of Autom. Sci. & Eng., Tampere Univ. of Technol., Tampere, Finland

fYear :

2015

fDate :

26-30 May 2015

Firstpage :

4807

Lastpage :

4812

Abstract :

Sequential decision making under uncertainty is studied in a mixed observability domain. The goal is to maximize the amount of information obtained on a partially observable stochastic process under constraints imposed by a fully observable internal state. An upper bound for the optimal value function is derived by relaxing constraints. We identify conditions under which the relaxed problem is a multi-armed bandit whose optimal policy is easily computable. The upper bound is applied to prune the search space in the original problem, and the effect on solution quality is assessed via simulation experiments. Empirical results show effective pruning of the search space in a target monitoring domain.

Keywords :

decision making; robots; sensors; stochastic processes; fully observable internal state; mixed observability domains; multiarmed bandit relaxations; optimal sensing; optimal value function; partially observable stochastic process; search space pruning; sequential decision making; target monitoring domain; upper bound; Indexes; Observability; Optimization; Robot sensing systems; Upper bound;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation (ICRA), 2015 IEEE International Conference on

Conference_Location :

Seattle, WA

Type :

conf

DOI :

10.1109/ICRA.2015.7139867

Filename :

7139867

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=716742