Title :
Structured Threshold Policies for Dynamic Sensor Scheduling—A Partially Observed Markov Decision Process Approach
Author :
Krishnamurthy, Vikram ; Djonin, Dejan V.
Author_Institution :
British Columbia Univ., Vancouver
Abstract :
We consider the optimal sensor scheduling problem formulated as a partially observed Markov decision process (POMDP). Due to operational constraints, at each time instant, the scheduler can dynamically select one out of a finite number of sensors and record a noisy measurement of an underlying Markov chain. The aim is to compute the optimal measurement scheduling policy, so as to minimize a cost function comprising of estimation errors and measurement costs. The formulation results in a nonstandard POMDP that is nonlinear in the information state. We give sufficient conditions on the cost function, dynamics of the Markov chain and observation probabilities so that the optimal scheduling policy has a threshold structure with respect to a monotone likelihood ratio (MLR) ordering. As a result, the computational complexity of implementing the optimal scheduling policy is inexpensive. We then present stochastic approximation algorithms for estimating the best linear MLR order threshold policy.
Keywords :
Markov processes; computational complexity; decision theory; minimisation; probability; scheduling; sensors; Markov chain; POMDP approach; cost function minimization; dynamic sensor scheduling; estimation errors; linear MLR order threshold policy; measurement costs; monotone likelihood ratio; observation probabilities; operational constraints; optimal measurement scheduling policy; optimal sensor scheduling policy; partially observed Markov decision process; stochastic approximation algorithms; structured threshold policies; Computational complexity; Cost function; Dynamic scheduling; Estimation error; Optimal scheduling; Processor scheduling; Stochastic processes; Sufficient conditions; Time factors; Time measurement; Bayesian filtering; monotone likelihood ratio (MLR) ordering; partially observed Markov decision processes (POMDPs); sensor scheduling; stochastic approximation algorithms; stochastic dynamic programming; threshold policies;
Journal_Title :
Signal Processing, IEEE Transactions on
DOI :
10.1109/TSP.2007.897908