Title :
Estimation of the short-term predictor parameters of speech under noisy conditions
Author :
Kuropatwinski, Marcin ; Kleijn, W. Bastiaan
Author_Institution :
R. Inst. of Technol., Stockholm
Abstract :
Speech coding algorithms that have been developed for clean speech are often used in a noisy environment. We describe maximum a posteriori (MAP) and minimum mean square error (MMSE) techniques to estimate the clean-speech short-term predictor (STP) parameters from noisy speech. The MAP and MMSE estimates are obtained using a likelihood function computed by means of the DFT or Kalman filtering and empirical probability distributions based on multidimensional histograms. The method is assessed in terms of the resulting root mean spectral distortion between the "clean" speech STP parameters and the STP parameters computed with the proposed method from noisy speech. The estimated parameters are also applied to obtain clean speech estimates by means of a Kalman filter. The quality of the estimated speech as compared to the "clean" speech is assessed by means of subjective tests, signal-to-noise ratio improvement, and the perceptual speech quality measurement method
Keywords :
Kalman filters; discrete Fourier transforms; distortion; least mean squares methods; maximum likelihood estimation; speech coding; statistical distributions; DFT; Kalman filtering; MAP techniques; MMSE techniques; likelihood function; maximum a posteriori techniques; minimum mean square error techniques; multidimensional histograms; noisy conditions; noisy speech; parameter estimation; perceptual speech quality measurement; probability distributions; root mean spectral distortion; short-term predictor parameters; signal-to-noise ratio; speech coding algorithms; Distributed computing; Filtering; Histograms; Kalman filters; Mean square error methods; Multidimensional systems; Parameter estimation; Probability distribution; Speech coding; Working environment noise; Maximum a posteriori estimation; minimum mean square error estimation; noise reduction; probabilistic modeling of speech; speech coding;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TSA.2005.858558