• DocumentCode
    3347807
  • Title

    A multimodal variational approach to learning and inference in switching state space models [speech processing application]

  • Author

    Lee, Leo J. ; Attias, Hagai ; Deng, Li ; Fieguth, Paul

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
  • Volume
    5
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    An important general model for discrete-time signal processing is the switching state space (SSS) model, which generalizes the hidden Markov model and the Gaussian state space model. Inference and parameter estimation in this model are known to be computationally intractable. This paper presents a powerful new approximation to the SSS model. The approximation is based on a variational technique that preserves the multimodal nature of the continuous state posterior distribution. Furthermore, by incorporating a windowing technique, the resulting EM algorithm has complexity that is just linear in the length of the time series. An alternative Viterbi decoding with frame-based likelihood is also presented which is crucial for the speech application that originally motivates this work. Our experiments focus on demonstrating the effectiveness of the algorithm by extensive simulations. A typical example in speech processing is also included to show the potential of this approach for practical applications.
  • Keywords
    Gaussian processes; Viterbi decoding; discrete time systems; frame based representation; hidden Markov models; learning (artificial intelligence); model-based reasoning; parameter estimation; speech processing; state-space methods; time series; variational techniques; EM algorithm; Gaussian state space model; SSS model approximation; continuous state posterior distribution; discrete-time signal processing; frame-based likelihood Viterbi decoding; hidden Markov model; inference; learning; multimodal variational technique; parameter estimation; speech processing; switching state space models; time series; windowing technique; Design engineering; Hidden Markov models; Parameter estimation; Power system modeling; Signal processing; Signal processing algorithms; Speech processing; State-space methods; Systems engineering and theory; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1327158
  • Filename
    1327158