DocumentCode :
2626330
Title :
Oracular Partially Observable Markov Decision Processes: A Very Special Case
Author :
Armstrong-Crews, Nicholas ; Veloso, Manuela
Author_Institution :
Robotics Inst., Carnegie Mellon Univ., Pittsburgh, PA
fYear :
2007
fDate :
10-14 April 2007
Firstpage :
2477
Lastpage :
2482
Abstract :
We introduce the oracular partially observable Markov decision process (OPOMDP), a type of POMDP in which the world produces no observations; instead there is an "oracle," available in any state, that tells the agent its exact state for a fixed cost. The oracle may be a human or a highly accurate sensor. At each timestep the agent must choose whether to take a domain-level action or consult the oracle. This formulation comprises a factorization between information-gathering actions and domain-level actions, allowing us to characterize the value of information and to examine the problem of planning under uncertainty from a novel perspective. We propose an algorithm to capitalize on this factorization and the special structure of the OPOMDP, and we test the algorithm\´s performance on a new sample domain. On this new domain, we are able to solve a problem with hundreds of thousands of action-states and vastly outperform a previous state-of-the-art approximate technique
Keywords :
Markov processes; decision theory; observability; domain-level action; information-gathering action; oracular partially observable Markov decision process; Algorithm design and analysis; Costs; Humans; Observability; Robot sensing systems; Robotics and automation; Sensor phenomena and characterization; State-space methods; Testing; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation, 2007 IEEE International Conference on
Conference_Location :
Roma
ISSN :
1050-4729
Print_ISBN :
1-4244-0601-3
Electronic_ISBN :
1050-4729
Type :
conf
DOI :
10.1109/ROBOT.2007.363691
Filename :
4209455
Link To Document :
بازگشت