DocumentCode
1968870
Title
Data driven suppression rule for speech enhancement
Author
Tashev, I. ; Slaney, M.
Author_Institution
Microsoft Res., Redmond, WA, USA
fYear
2013
fDate
10-15 Feb. 2013
Firstpage
1
Lastpage
6
Abstract
Audio signal enhancement often involves the application of a time-varying filter, or suppression rule, to the frequency-domain transform of a corrupted signal. Classic approaches use rules derived under Gaussian models and interpret them as spectral estimators in a Bayesian statistical framework. This mathematical approach provides rules that satisfy certain optimization criteria - maximum likelihood, mean square error, etc. In this paper we propose to learn the suppression rule from a representative training corpus and make it optimal in the sense of best perceived quality. This can be measured, for example, with the wideband PESQ algorithm, for which we cannot derive an analytic estimator. The proposed suppression rule is evaluated in controlled environment and shows improvements in the range of 0.1-0.2 PESQ points on a data corpus with SNRs ranging from -10 to +50 dB.
Keywords
Gaussian processes; audio signal processing; frequency-domain analysis; optimisation; speech enhancement; time-varying filters; Bayesian statistical framework; Gaussian models; SNR; audio signal enhancement; data driven suppression rule; frequency-domain transform; gain -10 dB to 50 dB; mathematical approach; mean square error; optimization criteria; signal corruption; speech enhancement; time-varying filter; wideband PESQ algorithm; Mathematical model; Maximum likelihood estimation; Optimization; Signal to noise ratio; Speech; Speech enhancement; noise suppression; speech enhancement; suppression rule;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Theory and Applications Workshop (ITA), 2013
Conference_Location
San Diego, CA
Print_ISBN
978-1-4673-4648-1
Type
conf
DOI
10.1109/ITA.2013.6502979
Filename
6502979
Link To Document