DocumentCode
904209
Title
A fused hidden Markov model with application to bimodal speech processing
Author
Pan, Hao ; Levinson, Stephen E. ; Huang, Thomas S. ; Liang, Zhi-Pei
Author_Institution
Sharp Labs. of America Inc., Camas, WA, USA
Volume
52
Issue
3
fYear
2004
fDate
3/1/2004 12:00:00 AM
Firstpage
573
Lastpage
581
Abstract
This paper presents a novel fused hidden Markov model (fused HMM) for integrating tightly coupled time series, such as audio and visual features of speech. In this model, the time series are first modeled by two conventional HMMs separately. The resulting HMMs are then fused together using a probabilistic fusion model, which is optimal according to the maximum entropy principle and a maximum mutual information criterion. Simulations and bimodal speaker verification experiments show that the proposed model can significantly reduce the recognition errors in noiseless or noisy environments.
Keywords
hidden Markov models; maximum entropy methods; speaker recognition; speech processing; HMM; bimodal speaker verification; bimodal speech processing; coupled time series; fused hidden Markov model; information fusion; maximum entropy principle; maximum mutual information criterion; noisy environment; probabilistic fusion model; recognition errors reduction; speech audio features; speech visual features; Computer errors; Entropy; Hidden Markov models; Joining processes; Mutual information; Noise reduction; Signal processing; Signal processing algorithms; Speech processing; Working environment noise;
fLanguage
English
Journal_Title
Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
1053-587X
Type
jour
DOI
10.1109/TSP.2003.822353
Filename
1268351
Link To Document