• DocumentCode
    1161168
  • Title

    Estimation of the short-term predictor parameters of speech under noisy conditions

  • Author

    Kuropatwinski, Marcin ; Kleijn, W. Bastiaan

  • Author_Institution
    R. Inst. of Technol., Stockholm
  • Volume
    14
  • Issue
    5
  • fYear
    2006
  • Firstpage
    1645
  • Lastpage
    1655
  • Abstract
    Speech coding algorithms that have been developed for clean speech are often used in a noisy environment. We describe maximum a posteriori (MAP) and minimum mean square error (MMSE) techniques to estimate the clean-speech short-term predictor (STP) parameters from noisy speech. The MAP and MMSE estimates are obtained using a likelihood function computed by means of the DFT or Kalman filtering and empirical probability distributions based on multidimensional histograms. The method is assessed in terms of the resulting root mean spectral distortion between the "clean" speech STP parameters and the STP parameters computed with the proposed method from noisy speech. The estimated parameters are also applied to obtain clean speech estimates by means of a Kalman filter. The quality of the estimated speech as compared to the "clean" speech is assessed by means of subjective tests, signal-to-noise ratio improvement, and the perceptual speech quality measurement method
  • Keywords
    Kalman filters; discrete Fourier transforms; distortion; least mean squares methods; maximum likelihood estimation; speech coding; statistical distributions; DFT; Kalman filtering; MAP techniques; MMSE techniques; likelihood function; maximum a posteriori techniques; minimum mean square error techniques; multidimensional histograms; noisy conditions; noisy speech; parameter estimation; perceptual speech quality measurement; probability distributions; root mean spectral distortion; short-term predictor parameters; signal-to-noise ratio; speech coding algorithms; Distributed computing; Filtering; Histograms; Kalman filters; Mean square error methods; Multidimensional systems; Parameter estimation; Probability distribution; Speech coding; Working environment noise; Maximum a posteriori estimation; minimum mean square error estimation; noise reduction; probabilistic modeling of speech; speech coding;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.858558
  • Filename
    1677984