مرکز منطقه ای اطلاع رساني علوم و فناوري - Integrating Kalman filtering and multi-pulse coding for speech enhancement with a non-stationary model of the speech signal

DocumentCode :

2654691

Title :

Integrating Kalman filtering and multi-pulse coding for speech enhancement with a non-stationary model of the speech signal

Author :

Li, Chunjian ; Andersen, Søren Vang

Author_Institution :

Dept. of Commun. Technol., Aalborg Univ., Denmark

Volume :

fYear :

2004

fDate :

7-10 Nov. 2004

Firstpage :

2300

Abstract :

In this paper, speech enhancement via Kalman filtering is considered. A non-stationary signal model for the speech signal is first described. This model consists of a slowly varying AR model and an excitation source that exhibits a rapidly time-varying variance. The AR model and the excitation model fit nicely into the Kalman filtering framework, fully exploiting the capability of the Kalman filter to process non-stationary signals in an LMMSE optimum manner. The AR-model coefficients are estimated by a decision-directed type power spectral subtraction method followed by an LPC analysis. For the robust estimation of the rapidly time-varying excitation model in the presence of noise, we propose the use of a multi-pulse linear predictive coding (MPLPC) based method. The Kalman filtering algorithm based on the non-stationary signal model is able to partially avoid the commonly used quasi-stationarity assumption of the speech. Therefore the non-stationarity of the signal is fully exploited in suppressing the noise power that is more stationary. Our experiments show that the Kalman filter with rapidly time-varying variance modeling using the proposed MPLPC based method brings significant performance improvement both when compared to a baseline Kalman filtering method with quasi-stationarity assumption and when compared to the well-known MMSE log-spectral amplitude estimator (MMSE-LSA).

Keywords :

Kalman filters; amplitude estimation; filtering theory; least mean squares methods; linear codes; signal denoising; spectral analysis; speech coding; speech enhancement; Kalman filtering; LMMSE optimum manner; excitation source; log-spectral amplitude estimator; multipulse linear predictive coding; noise power suppression; power spectral subtraction method; robust estimation; speech enhancement; speech quasi-stationarity assumption; speech signal nonstationary model; time-varying excitation model; time-varying variance; Filtering; Kalman filters; Linear predictive coding; Noise robustness; Signal processing; Signal resolution; Speech analysis; Speech coding; Speech enhancement; Wiener filter;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signals, Systems and Computers, 2004. Conference Record of the Thirty-Eighth Asilomar Conference on

Print_ISBN :

0-7803-8622-1

Type :

conf

DOI :

10.1109/ACSSC.2004.1399578

Filename :

1399578

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2654691