• DocumentCode
    417239
  • Title

    Speech enhancement by perceptual filter with sequential noise parameter estimation

  • Author

    Lee, Te-Won ; Yao, Kaisheng

  • Author_Institution
    Inst. for Neural Comput., Univ. of California, San Diego, CA, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We report work on speech enhancement that combines sequential noise estimation and perceptual filtering. The sequential estimation employs an extension of the sequential EM-type algorithm. In the algorithm, statistics of clean speech are modeled by hidden Markov models (HMM) and noise is assumed to be Gaussian distributed with a time-varying mean vector (the noise parameter) to be estimated. The estimation process uses a non-linear function that relates speech statistics, noise, and noisy observation. With the estimated noise parameter, the subtraction-type algorithm for speech enhancement may be extended to non-stationary environments. In particular, a perceptual filter with frequency masking is constructed with a tradeoff between noise reduction and speech distortion considering the sensitivity of speech recognition systems to speech distortion. Our experiments in speech enhancement and speech recognition in non-stationary noise confirmed that this approach seems promising in improving performances compared to alternative speech enhancement algorithms.
  • Keywords
    Gaussian noise; acoustic noise; filtering theory; hidden Markov models; maximum likelihood estimation; optimisation; parameter estimation; sequential estimation; speech enhancement; speech recognition; EM algorithm; Gaussian distribution; Gaussian noise; HMM; hidden Markov models; maximum likelihood estimation; nonlinear function; perceptual filtering; sequential estimation; sequential noise parameter estimation; speech distortion; speech enhancement; speech recognition; speech statistics; time-varying mean vector; Filters; Gaussian noise; Hidden Markov models; Nonlinear distortion; Parameter estimation; Speech coding; Speech enhancement; Speech recognition; Statistical distributions; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326080
  • Filename
    1326080