• DocumentCode
    1395140
  • Title

    Dynamic Quantization and Power Allocation for Multisensor Estimation of Hidden Markov Models

  • Author

    Ghasemi, Nader ; Dey, Subhrakanti

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Univ. of Melbourne, Melbourne, VIC, Australia
  • Volume
    57
  • Issue
    7
  • fYear
    2012
  • fDate
    7/1/2012 12:00:00 AM
  • Firstpage
    1641
  • Lastpage
    1656
  • Abstract
    This paper investigates an optimal quantizer design problem for multisensor estimation of a hidden Markov model (HMMs) whose description depends on unknown parameters. The sensor measurements are simply binary quantized and transmitted to a remote fusion center over noisy flat fading wireless channels under an average sum transmit power constraint. The objective is to determine a set of optimal quantization thresholds and sensor transmit powers, called an optimal policy, which minimizes the long run average of a weighted combination of the expected state estimation error and sum transmit power. We analyze the problem by formulating an adaptive Markov decision process (MDP) problem. In this framework, adaptive optimal control policies are obtained using a nonstationary value iteration (NVI) scheme and are termed as NVI-adaptive policies. These NVI-adaptive policies are adapted to the HMM parameter estimates obtained via a strongly consistent maximum likelihood estimator. In particular, HMM parameter estimation is performed by a recursive expectation-maximization (EM) algorithm which computes estimates of the HMM parameters by maximizing a relative entropy information measure using the received quantized observations and the trajectory of the MDP. Under some regularity assumptions on the observation probability distributions and a geometric ergodicity condition on an extended Markov chain, the maximum-likelihood estimator is shown to be strongly consistent. It is shown that the NVI-adaptive policy based on this sequence of strongly consistent HMM parameter estimates is (asymptotically, under appropriate assumptions) average-optimal. Essentially, it minimizes the long run average cost of the weighted combination of the expected state estimation error and sum transmit power across the sensors for the HMM with true parameters in a time-asymptotic sense. The advantage of this scheme is that the policies are obtained recursively without the need to solve the Bellman equat- on at each time step, which can be computationally prohibitive. As is usual with value iteration schemes, practical implementation of the NVI-adaptive policy requires discretization of the state and action space, which results in some loss of optimality. Nevertheless, numerical results illustrate the asymptotic convergence properties of the parameter estimates and the asymptotically close to optimal performance of the adaptive MDP algorithm compared to the performance of an MDP based dynamic quantization and power allocation algorithm designed with perfect knowledge of the true parameters.
  • Keywords
    fading channels; hidden Markov models; iterative methods; maximum likelihood estimation; parameter estimation; quantisation (signal); sensor fusion; state estimation; wireless sensor networks; Bellman equation; HMM parameter; NVI-adaptive policy; adaptive Markov decision process; adaptive optimal control policy; average sum transmit power constraint; dynamic quantization; extended Markov chain; geometric ergodicity condition; hidden Markov model; multisensor estimation; noisy flat fading wireless channels; nonstationary value iteration scheme; observation probability distributions; optimal policy; optimal quantization threshold; optimal quantizer design problem; parameter estimation; power allocation; power allocation algorithm; recursive expectation-maximization algorithm; remote fusion center; sensor measurement; state estimation error; strongly consistent maximum likelihood estimator; wireless sensor network; Hidden Markov models; Markov processes; Parameter estimation; Quantization; State estimation; Vectors; Hidden Markov models (HMMs); maximum-likelihood (ML) estimation; quantization; state estimation; wireless sensor networks;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.2011.2179420
  • Filename
    6099562