• DocumentCode
    730101
  • Title

    Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation

  • Author

    Krawczyk-Becker, Martin ; Fischer, Dorte ; Gerkmann, Timo

  • Author_Institution
    Dept. of Med. Phys. & Acoust. & Cluster of Excellence “Hearing4all”, Univ. of Oldenburg, Oldenburg, Germany
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    365
  • Lastpage
    369
  • Abstract
    For the enhancement of speech degraded by noise, accurate estimation of the noise power spectral density (PSD) is indispensable, especially if only a single microphone signal is available. Fast and accurate tracking of the noise PSD is particularly challenging in highly non-stationary noise types, since the distinction between speech and noise components becomes more difficult. Short-time discrete Fourier transform (STFT) based noise PSD estimation algorithms which employ estimates of the speech presence probability (SPP) with fixed priors have been shown to yield good tracking performance even in adverse noise conditions. In this paper, we compare two methods to incorporate spectro-temporal correlations to improve the tracking performance. The first method smoothes the noisy observation over time and frequency before computing the SPP, while the second is based on a Hidden Markov Model (HMM) of the speech presence and absence states. We show that the proposed modifications lead to improved noise PSD estimators which are less sensitive to spectral outliers of the noise and track changes in the noise PSD more quickly than the reference method. Further, when employed in a common speech enhancement setup, the proposed estimators achieve an increased noise reduction while keeping speech distortions at a comparable level.
  • Keywords
    discrete Fourier transforms; hidden Markov models; probability; speech enhancement; HMM; SPP; STFT based noise PSD estimation algorithms; hidden Markov model; noise PSD estimators; noise power spectral density; non-stationary noise types; short-time discrete Fourier transform based noise PSD estimation algorithms; spectral outliers; spectro-temporal correlations; speech distortions; speech enhancement setup; speech presence probability; Estimation; Hidden Markov models; Signal to noise ratio; Speech; Speech enhancement; Time-frequency analysis; noise power estimation; noise reduction; speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7177992
  • Filename
    7177992