Title :
HMM-based strategies for enhancement of speech signals embedded in nonstationary noise
Author :
Sameti, Hossein ; Sheikhzadeh, Hamid ; Deng, Li ; Brennan, Robert L.
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
fDate :
9/1/1998 12:00:00 AM
Abstract :
An improved hidden Markov model-based (HMM-based) speech enhancement system designed using the minimum mean square error principle is implemented and compared with a conventional spectral subtraction system. The improvements to the system are: (1) incorporation of mixture components in the HMM for noise in order to handle noise nonstationarity in a more flexible manner, (2) two efficient methods in the speech enhancement system design that make the system real-time implementable, and (3) an adaptation method to the noise type in order to accommodate a wide variety of noise expected under the enhancement system´s operating environment. The results of the experiments designed to evaluate the performance of the HMM-based speech enhancement systems in comparison with spectral subtraction are reported. Three types of noise-white noise, simulated helicopter noise, and multitalker (cocktail party) noise-were used to corrupt the test speech signals. Both objective (global SNR) and subjective mean opinion score (MOS) evaluations demonstrate consistent superiority of the HMM-based enhancement systems that incorporate the innovations described in this paper over the conventional spectral subtraction method
Keywords :
acoustic noise; hidden Markov models; least mean squares methods; spectral analysis; speech enhancement; white noise; HMM-based speech enhancement system; adaptation method; cocktail party noise; experiments; hidden Markov model; mixture components; multitalker noise; noise nonstationarity; nonstationary noise; real-time system; simulated helicopter noise; spectral subtraction method; test speech signals; white noise; Acoustic noise; Auditory system; Background noise; Digital signal processing; Hidden Markov models; Low-frequency noise; Signal to noise ratio; Speech coding; Speech enhancement; Working environment noise;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on