DocumentCode
3347807
Title
A multimodal variational approach to learning and inference in switching state space models [speech processing application]
Author
Lee, Leo J. ; Attias, Hagai ; Deng, Li ; Fieguth, Paul
Author_Institution
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
Volume
5
fYear
2004
fDate
17-21 May 2004
Abstract
An important general model for discrete-time signal processing is the switching state space (SSS) model, which generalizes the hidden Markov model and the Gaussian state space model. Inference and parameter estimation in this model are known to be computationally intractable. This paper presents a powerful new approximation to the SSS model. The approximation is based on a variational technique that preserves the multimodal nature of the continuous state posterior distribution. Furthermore, by incorporating a windowing technique, the resulting EM algorithm has complexity that is just linear in the length of the time series. An alternative Viterbi decoding with frame-based likelihood is also presented which is crucial for the speech application that originally motivates this work. Our experiments focus on demonstrating the effectiveness of the algorithm by extensive simulations. A typical example in speech processing is also included to show the potential of this approach for practical applications.
Keywords
Gaussian processes; Viterbi decoding; discrete time systems; frame based representation; hidden Markov models; learning (artificial intelligence); model-based reasoning; parameter estimation; speech processing; state-space methods; time series; variational techniques; EM algorithm; Gaussian state space model; SSS model approximation; continuous state posterior distribution; discrete-time signal processing; frame-based likelihood Viterbi decoding; hidden Markov model; inference; learning; multimodal variational technique; parameter estimation; speech processing; switching state space models; time series; windowing technique; Design engineering; Hidden Markov models; Parameter estimation; Power system modeling; Signal processing; Signal processing algorithms; Speech processing; State-space methods; Systems engineering and theory; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1327158
Filename
1327158
Link To Document