DocumentCode :
2654691
Title :
Integrating Kalman filtering and multi-pulse coding for speech enhancement with a non-stationary model of the speech signal
Author :
Li, Chunjian ; Andersen, Søren Vang
Author_Institution :
Dept. of Commun. Technol., Aalborg Univ., Denmark
Volume :
2
fYear :
2004
fDate :
7-10 Nov. 2004
Firstpage :
2300
Abstract :
In this paper, speech enhancement via Kalman filtering is considered. A non-stationary signal model for the speech signal is first described. This model consists of a slowly varying AR model and an excitation source that exhibits a rapidly time-varying variance. The AR model and the excitation model fit nicely into the Kalman filtering framework, fully exploiting the capability of the Kalman filter to process non-stationary signals in an LMMSE optimum manner. The AR-model coefficients are estimated by a decision-directed type power spectral subtraction method followed by an LPC analysis. For the robust estimation of the rapidly time-varying excitation model in the presence of noise, we propose the use of a multi-pulse linear predictive coding (MPLPC) based method. The Kalman filtering algorithm based on the non-stationary signal model is able to partially avoid the commonly used quasi-stationarity assumption of the speech. Therefore the non-stationarity of the signal is fully exploited in suppressing the noise power that is more stationary. Our experiments show that the Kalman filter with rapidly time-varying variance modeling using the proposed MPLPC based method brings significant performance improvement both when compared to a baseline Kalman filtering method with quasi-stationarity assumption and when compared to the well-known MMSE log-spectral amplitude estimator (MMSE-LSA).
Keywords :
Kalman filters; amplitude estimation; filtering theory; least mean squares methods; linear codes; signal denoising; spectral analysis; speech coding; speech enhancement; Kalman filtering; LMMSE optimum manner; excitation source; log-spectral amplitude estimator; multipulse linear predictive coding; noise power suppression; power spectral subtraction method; robust estimation; speech enhancement; speech quasi-stationarity assumption; speech signal nonstationary model; time-varying excitation model; time-varying variance; Filtering; Kalman filters; Linear predictive coding; Noise robustness; Signal processing; Signal resolution; Speech analysis; Speech coding; Speech enhancement; Wiener filter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers, 2004. Conference Record of the Thirty-Eighth Asilomar Conference on
Print_ISBN :
0-7803-8622-1
Type :
conf
DOI :
10.1109/ACSSC.2004.1399578
Filename :
1399578
Link To Document :
بازگشت