DocumentCode :
1327578
Title :
Robust speech recognition based on joint model and feature space optimization of hidden Markov models
Author :
Moon, Seokyong ; Hwang, Jenq-Neng
Author_Institution :
Commun. Syst. Res. & Dev., Samsung Electron., Seoul, South Korea
Volume :
8
Issue :
2
fYear :
1997
fDate :
3/1/1997 12:00:00 AM
Firstpage :
194
Lastpage :
204
Abstract :
The hidden Markov model (HMM) inversion algorithm, based on either the gradient search or the Baum-Welch reestimation of input speech features, is proposed and applied to the robust speech recognition tasks under general types of mismatch conditions. This algorithm stems from the gradient-based inversion algorithm of an artificial neural network (ANN) by viewing an HMM as a special type of ANN. Given input speech features s, the forward training of an HMM finds the model parameters λ subject to an optimization criterion. On the other hand, the inversion of an HMM finds speech features, s, subject to an optimization criterion with given model parameters λ. The gradient-based HMM inversion and the Baum-Welch HMM inversion algorithms can be successfully integrated with the model space optimization techniques, such as the robust MINIMAX technique, to compensate the mismatch in the joint model and feature space. The joint space mismatch compensation technique achieves better performance than the single space, i.e. either the model space or the feature space alone, mismatch compensation techniques. It is also demonstrated that approximately 10-dB signal-to-noise ratio (SNR) gain is obtained in the low SNR environments when the joint model and feature space mismatch compensation technique is used
Keywords :
compensation; hidden Markov models; maximum likelihood estimation; neural nets; search problems; speech recognition; Baum-Welch reestimation; forward training; gradient search; hidden Markov models; inversion algorithm; joint model/feature space optimization; joint space mismatch compensation technique; mismatch conditions; optimization criterion; robust MINIMAX technique; robust speech recognition; speech features; Artificial neural networks; Automatic speech recognition; Filtering; Hidden Markov models; Minimax techniques; Moon; Noise robustness; Speech enhancement; Speech recognition; Wiener filter;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/72.557656
Filename :
557656
Link To Document :
بازگشت