Estimation of the short-term predictor parameters of speech under noisy conditions

Author

Kuropatwinski, Marcin ; Kleijn, W. Bastiaan

Author_Institution

R. Inst. of Technol., Stockholm

Volume

14

Issue

5

fYear

2006

Firstpage

1645

Lastpage

1655

Abstract

Speech coding algorithms that have been developed for clean speech are often used in a noisy environment. We describe maximum a posteriori (MAP) and minimum mean square error (MMSE) techniques to estimate the clean-speech short-term predictor (STP) parameters from noisy speech. The MAP and MMSE estimates are obtained using a likelihood function computed by means of the DFT or Kalman filtering and empirical probability distributions based on multidimensional histograms. The method is assessed in terms of the resulting root mean spectral distortion between the "clean" speech STP parameters and the STP parameters computed with the proposed method from noisy speech. The estimated parameters are also applied to obtain clean speech estimates by means of a Kalman filter. The quality of the estimated speech as compared to the "clean" speech is assessed by means of subjective tests, signal-to-noise ratio improvement, and the perceptual speech quality measurement method

Keywords

Kalman filters; discrete Fourier transforms; distortion; least mean squares methods; maximum likelihood estimation; speech coding; statistical distributions; DFT; Kalman filtering; MAP techniques; MMSE techniques; likelihood function; maximum a posteriori techniques; minimum mean square error techniques; multidimensional histograms; noisy conditions; noisy speech; parameter estimation; perceptual speech quality measurement; probability distributions; root mean spectral distortion; short-term predictor parameters; signal-to-noise ratio; speech coding algorithms; Distributed computing; Filtering; Histograms; Kalman filters; Mean square error methods; Multidimensional systems; Parameter estimation; Probability distribution; Speech coding; Working environment noise; Maximum a posteriori estimation; minimum mean square error estimation; noise reduction; probabilistic modeling of speech; speech coding;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher

ieee

ISSN

1558-7916

Type

jour

DOI

10.1109/TSA.2005.858558

Filename

1677984