Title :
Improving tractability of POMDPs by separation of decision and perceptual processes
Author :
Fakoor, Rasool ; Huber, Manfred
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
Abstract :
Markov Decision Processes (MDPs) and Partially Observable Markov Decision Processes (POMDPs) are very powerful frameworks to model decision and decision learning tasks in a wide range of problem domains. Thus, they are used widely in complex and real-world situations such as robot control tasks. However, this modeling power and generality of the framework comes at a cost in that the complexity of the underlying model and corresponding algorithms grows dramatically as the complexity of the task domain increases. To address this issue in the context of tasks where raw sensory features are used as a basis for complex decision making, this paper presents an integrated and adaptive approach that attempts to reduce the complexity of the decision learning problem by separating the POMDP model into separate decision and perceptual processes. In the proposed framework, a sampling method is used for the perceptual process and reinforcement learning serves to address the decision process. Handling the perceptual and decision processes separately here promises the potential to make it easier to extract relevant perceptual information and concentrate the decision process on relevant state attributes. This, in turn, promises to allow the framework to scale to problems in which traditional POMDP methods are intractable. We show and discuss the effectiveness of our method analytically and empirically.
Keywords :
Markov processes; decision making; decision theory; learning (artificial intelligence); sampling methods; POMDP; adaptive approach; complex decision making; complexity reduction; decision learning problem; decision learning task; integrated approach; partially observable Markov decision processes; perceptual process; raw sensory feature; reinforcement learning; robot control task; sampling method; tractability improvement; Atmospheric measurements; Complexity theory; Equations; Particle measurements; Robot kinematics; Uncertainty; Complexity; Decision Process; POMDP; Perceptual Process;
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-1713-9
Electronic_ISBN :
978-1-4673-1712-2
DOI :
10.1109/ICSMC.2012.6377790