DocumentCode
1161168
Title
Estimation of the short-term predictor parameters of speech under noisy conditions
Author
Kuropatwinski, Marcin ; Kleijn, W. Bastiaan
Author_Institution
R. Inst. of Technol., Stockholm
Volume
14
Issue
5
fYear
2006
Firstpage
1645
Lastpage
1655
Abstract
Speech coding algorithms that have been developed for clean speech are often used in a noisy environment. We describe maximum a posteriori (MAP) and minimum mean square error (MMSE) techniques to estimate the clean-speech short-term predictor (STP) parameters from noisy speech. The MAP and MMSE estimates are obtained using a likelihood function computed by means of the DFT or Kalman filtering and empirical probability distributions based on multidimensional histograms. The method is assessed in terms of the resulting root mean spectral distortion between the "clean" speech STP parameters and the STP parameters computed with the proposed method from noisy speech. The estimated parameters are also applied to obtain clean speech estimates by means of a Kalman filter. The quality of the estimated speech as compared to the "clean" speech is assessed by means of subjective tests, signal-to-noise ratio improvement, and the perceptual speech quality measurement method
Keywords
Kalman filters; discrete Fourier transforms; distortion; least mean squares methods; maximum likelihood estimation; speech coding; statistical distributions; DFT; Kalman filtering; MAP techniques; MMSE techniques; likelihood function; maximum a posteriori techniques; minimum mean square error techniques; multidimensional histograms; noisy conditions; noisy speech; parameter estimation; perceptual speech quality measurement; probability distributions; root mean spectral distortion; short-term predictor parameters; signal-to-noise ratio; speech coding algorithms; Distributed computing; Filtering; Histograms; Kalman filters; Mean square error methods; Multidimensional systems; Parameter estimation; Probability distribution; Speech coding; Working environment noise; Maximum a posteriori estimation; minimum mean square error estimation; noise reduction; probabilistic modeling of speech; speech coding;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TSA.2005.858558
Filename
1677984
Link To Document