• DocumentCode
    57031
  • Title

    Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement

  • Author

    Van-Khanh Mai ; Pastor, Dominique ; Aissa-El-Bey, Abdeldjalil ; Le-Bidan, Raphael

  • Author_Institution
    Lab.-STIC, Univ. Eur. de Bretagne, Brest, France
  • Volume
    23
  • Issue
    4
  • fYear
    2015
  • fDate
    Apr-15
  • Firstpage
    670
  • Lastpage
    682
  • Abstract
    We propose a novel method for noise power spectrum estimation in speech enhancement. This method called extended-DATE (E-DATE) extends the d-dimensional amplitude trimmed estimator (DATE), originally introduced for additive white gaussian noise power spectrum estimation in “Robust estimation of noise standard deviation in presence of signals with unknown distributions and occurrences” (D. Pastor and F. Socheleau, IEEE Trans. Signal Processing, vol. 60, no. 4, pp. 1545-1555, Apr. 2012) to the more challenging scenario of non-stationary noise. The key idea is that, in each frequency bin and within a sufficiently short time period, the noise instantaneous power spectrum can be considered as approximately constant and estimated as the variance of a complex gaussian noise process possibly observed in the presence of the signal of interest. The proposed method relies on the fact that the Short-Time Fourier Transform (STFT) of noisy speech signals is sparse in the sense that transformed speech signals can be represented by a relatively small number of coefficients with large amplitudes in the time-frequency domain. The E-DATE estimator is robust in that it does not require prior information about the signal probability distribution except for the weak-sparseness property. In comparison to other state-of-the-art methods, the E-DATE is found to require the smallest number of parameters (only two). The performance of the proposed estimator has been evaluated in combination with noise reduction and compared to alternative methods. This evaluation involves objective as well as pseudo-subjective criteria.
  • Keywords
    AWGN; Fourier transforms; Gaussian processes; estimation theory; interference suppression; signal denoising; speech enhancement; statistical distributions; E-DATE; Gaussian noise process; STFT; additive white Gaussian noise power spectrum estimation; d-dimensional amplitude trimmed estimator; extended-DATE; noise reduction; noisy speech signals; nonstationary noise power spectrum; robust estimation; short-time Fourier transform; signal probability distribution; speech enhancement; time-frequency domain; weak-sparseness property; Noise; Noise measurement; Smoothing methods; Spectral analysis; Speech; Speech enhancement; Speech enhancement; noise power spectrum estimation; noise reduction; robust statistics;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2401426
  • Filename
    7035013