DocumentCode :
1395140
Title :
Dynamic Quantization and Power Allocation for Multisensor Estimation of Hidden Markov Models
Author :
Ghasemi, Nader ; Dey, Subhrakanti
Author_Institution :
Dept. of Electr. & Electron. Eng., Univ. of Melbourne, Melbourne, VIC, Australia
Volume :
57
Issue :
7
fYear :
2012
fDate :
7/1/2012 12:00:00 AM
Firstpage :
1641
Lastpage :
1656
Abstract :
This paper investigates an optimal quantizer design problem for multisensor estimation of a hidden Markov model (HMMs) whose description depends on unknown parameters. The sensor measurements are simply binary quantized and transmitted to a remote fusion center over noisy flat fading wireless channels under an average sum transmit power constraint. The objective is to determine a set of optimal quantization thresholds and sensor transmit powers, called an optimal policy, which minimizes the long run average of a weighted combination of the expected state estimation error and sum transmit power. We analyze the problem by formulating an adaptive Markov decision process (MDP) problem. In this framework, adaptive optimal control policies are obtained using a nonstationary value iteration (NVI) scheme and are termed as NVI-adaptive policies. These NVI-adaptive policies are adapted to the HMM parameter estimates obtained via a strongly consistent maximum likelihood estimator. In particular, HMM parameter estimation is performed by a recursive expectation-maximization (EM) algorithm which computes estimates of the HMM parameters by maximizing a relative entropy information measure using the received quantized observations and the trajectory of the MDP. Under some regularity assumptions on the observation probability distributions and a geometric ergodicity condition on an extended Markov chain, the maximum-likelihood estimator is shown to be strongly consistent. It is shown that the NVI-adaptive policy based on this sequence of strongly consistent HMM parameter estimates is (asymptotically, under appropriate assumptions) average-optimal. Essentially, it minimizes the long run average cost of the weighted combination of the expected state estimation error and sum transmit power across the sensors for the HMM with true parameters in a time-asymptotic sense. The advantage of this scheme is that the policies are obtained recursively without the need to solve the Bellman equat- on at each time step, which can be computationally prohibitive. As is usual with value iteration schemes, practical implementation of the NVI-adaptive policy requires discretization of the state and action space, which results in some loss of optimality. Nevertheless, numerical results illustrate the asymptotic convergence properties of the parameter estimates and the asymptotically close to optimal performance of the adaptive MDP algorithm compared to the performance of an MDP based dynamic quantization and power allocation algorithm designed with perfect knowledge of the true parameters.
Keywords :
fading channels; hidden Markov models; iterative methods; maximum likelihood estimation; parameter estimation; quantisation (signal); sensor fusion; state estimation; wireless sensor networks; Bellman equation; HMM parameter; NVI-adaptive policy; adaptive Markov decision process; adaptive optimal control policy; average sum transmit power constraint; dynamic quantization; extended Markov chain; geometric ergodicity condition; hidden Markov model; multisensor estimation; noisy flat fading wireless channels; nonstationary value iteration scheme; observation probability distributions; optimal policy; optimal quantization threshold; optimal quantizer design problem; parameter estimation; power allocation; power allocation algorithm; recursive expectation-maximization algorithm; remote fusion center; sensor measurement; state estimation error; strongly consistent maximum likelihood estimator; wireless sensor network; Hidden Markov models; Markov processes; Parameter estimation; Quantization; State estimation; Vectors; Hidden Markov models (HMMs); maximum-likelihood (ML) estimation; quantization; state estimation; wireless sensor networks;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/TAC.2011.2179420
Filename :
6099562
Link To Document :
بازگشت