Title :
Weighted Likelihood Ratio (WLR) Hidden Markov Model for Noisy Speech Recognition
Author :
Huang, Chao ; Huang, Yingchun ; Soong, Frank ; Zhou, Jianlai
Author_Institution :
Microsoft Res. Asia, Beijing
Abstract :
In this paper we present a weighted likelihood ratio (WLR) based hidden Markov model and apply it to speech recognition in noise. The WLR measure emphasizes spectral peaks than valleys in comparing two given speech spectra. The measure is more consistent with human perception of speech formants where natural resonances of vocal track are and tends to be more robust to broad-band noise interferences than other measures. A complete HMM framework of this measure is derived and a mixture of exponential kernels is used to model the output probability density function. The new WLR-HMM is tested on the Aurora2 connected digits database in noise. It shows more robust performance than the MFCC trained GMM baseline system. When combined with the dynamic cepstral features, the multiple-stream WLR-HMM shows a 39% relative improvement over the baseline system
Keywords :
hidden Markov models; interference (signal); speech recognition; Aurora2 connected digits database; broad-band noise interferences; dynamic cepstral features; exponential kernels; hidden Markov model; noisy speech recognition; output probability density function; spectral peaks; weighted likelihood ratio; Density measurement; Hidden Markov models; Humans; Interference; Noise measurement; Noise robustness; Resonance; Signal to noise ratio; Speech enhancement; Speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1659951