Title :
Smoothed spectral subtraction for a frequency-weighted HMM in noisy speech recognition
Author :
Matsumoto, Hiroshi ; Naitoh, Noboru
Author_Institution :
Dept. of Electr. & Electron. Eng., Shinshu Univ., Nagano, Japan
Abstract :
The paper proposes improved methods of smoothed spectral subtraction to enhance the recognition performance of a frequency weighted HMM (HMM-FW) in very noisy environments. The conventional spectral subtraction tends to produce discontinuity in estimated power spectra. This distortion is undesirable for HMM-FW which uses group delay spectra as feature vectors. In order to remove this distortion, the paper proposes two frequency smoothing methods in log spectral domain: (1) a low pass filtering by DCT; and (2) a weighted minimum mean square error method (WMSE) which fits cosine series to an estimated log power spectrum. The results show that the smoothers are very effective under very noisy conditions, especially for the frequency weighted HMM. The WMSE method combined with HMM-FW achieves the highest recognition accuracies, for instance, improving recognition rate from 68% to 88% at -6 dB SNR of car noise
Keywords :
acoustic noise; hidden Markov models; least mean squares methods; speech enhancement; speech recognition; DCT; WMSE method; cosine series; estimated log power spectrum; estimated power spectra; feature vectors; frequency smoothing methods; frequency weighted HMM; group delay spectra; log spectral domain; low pass filtering; noisy conditions; noisy environments; noisy speech recognition; recognition accuracies; recognition performance; recognition rate; smoothed spectral subtraction; weighted minimum mean square error method; Delay; Discrete cosine transforms; Filtering; Frequency estimation; Hidden Markov models; Low pass filters; Mean square error methods; Signal to noise ratio; Smoothing methods; Working environment noise;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607748