DocumentCode :
1420398
Title :
HMM-based strategies for enhancement of speech signals embedded in nonstationary noise
Author :
Sameti, Hossein ; Sheikhzadeh, Hamid ; Deng, Li ; Brennan, Robert L.
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
Volume :
6
Issue :
5
fYear :
1998
fDate :
9/1/1998 12:00:00 AM
Firstpage :
445
Lastpage :
455
Abstract :
An improved hidden Markov model-based (HMM-based) speech enhancement system designed using the minimum mean square error principle is implemented and compared with a conventional spectral subtraction system. The improvements to the system are: (1) incorporation of mixture components in the HMM for noise in order to handle noise nonstationarity in a more flexible manner, (2) two efficient methods in the speech enhancement system design that make the system real-time implementable, and (3) an adaptation method to the noise type in order to accommodate a wide variety of noise expected under the enhancement system´s operating environment. The results of the experiments designed to evaluate the performance of the HMM-based speech enhancement systems in comparison with spectral subtraction are reported. Three types of noise-white noise, simulated helicopter noise, and multitalker (cocktail party) noise-were used to corrupt the test speech signals. Both objective (global SNR) and subjective mean opinion score (MOS) evaluations demonstrate consistent superiority of the HMM-based enhancement systems that incorporate the innovations described in this paper over the conventional spectral subtraction method
Keywords :
acoustic noise; hidden Markov models; least mean squares methods; spectral analysis; speech enhancement; white noise; HMM-based speech enhancement system; adaptation method; cocktail party noise; experiments; hidden Markov model; mixture components; multitalker noise; noise nonstationarity; nonstationary noise; real-time system; simulated helicopter noise; spectral subtraction method; test speech signals; white noise; Acoustic noise; Auditory system; Background noise; Digital signal processing; Hidden Markov models; Low-frequency noise; Signal to noise ratio; Speech coding; Speech enhancement; Working environment noise;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.709670
Filename :
709670
Link To Document :
بازگشت