Title :
Structure of optimal policies in active sensing
Author_Institution :
Dept. of Electr. Eng., McGill Univ., Montreal, QC, Canada
Abstract :
We consider the optimal design of a sensing system in which a sensor can choose how and when to communicate to an estimator. The optimal choice of transmission and estimation policies is made difficult by the fact that the sensor and the estimator may use their entire history of observations. Traditionally, Markov decision theory is used to analyze such multi-stage decision problems. But, Markov decision theory assumes a single decision maker-an assumption that is not satisfied in an active sensing system that has two decision makers with different information. In this paper, we use the approach of Nayyar et al (2011) to investigate the system as a dynamic team. Using a series of structural results, we show that the optimal policy is easy to implement. We also obtain a dynamic programming decomposition to find optimal sensing and estimation policies.
Keywords :
Markov processes; decision making; decision theory; dynamic programming; estimation theory; wireless sensor networks; Markov decision theory; active sensing system; decision making; dynamic programming; estimation policy; optimal design policy; Arrays; Decision theory; Dynamic programming; Estimation; Markov processes; Sensors; Yttrium; dynamic teams; estimation theory; sensor networks; stochastic control;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289108