DocumentCode
730101
Title
Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation
Author
Krawczyk-Becker, Martin ; Fischer, Dorte ; Gerkmann, Timo
Author_Institution
Dept. of Med. Phys. & Acoust. & Cluster of Excellence “Hearing4all”, Univ. of Oldenburg, Oldenburg, Germany
fYear
2015
fDate
19-24 April 2015
Firstpage
365
Lastpage
369
Abstract
For the enhancement of speech degraded by noise, accurate estimation of the noise power spectral density (PSD) is indispensable, especially if only a single microphone signal is available. Fast and accurate tracking of the noise PSD is particularly challenging in highly non-stationary noise types, since the distinction between speech and noise components becomes more difficult. Short-time discrete Fourier transform (STFT) based noise PSD estimation algorithms which employ estimates of the speech presence probability (SPP) with fixed priors have been shown to yield good tracking performance even in adverse noise conditions. In this paper, we compare two methods to incorporate spectro-temporal correlations to improve the tracking performance. The first method smoothes the noisy observation over time and frequency before computing the SPP, while the second is based on a Hidden Markov Model (HMM) of the speech presence and absence states. We show that the proposed modifications lead to improved noise PSD estimators which are less sensitive to spectral outliers of the noise and track changes in the noise PSD more quickly than the reference method. Further, when employed in a common speech enhancement setup, the proposed estimators achieve an increased noise reduction while keeping speech distortions at a comparable level.
Keywords
discrete Fourier transforms; hidden Markov models; probability; speech enhancement; HMM; SPP; STFT based noise PSD estimation algorithms; hidden Markov model; noise PSD estimators; noise power spectral density; non-stationary noise types; short-time discrete Fourier transform based noise PSD estimation algorithms; spectral outliers; spectro-temporal correlations; speech distortions; speech enhancement setup; speech presence probability; Estimation; Hidden Markov models; Signal to noise ratio; Speech; Speech enhancement; Time-frequency analysis; noise power estimation; noise reduction; speech enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7177992
Filename
7177992
Link To Document