DocumentCode :
1968870
Title :
Data driven suppression rule for speech enhancement
Author :
Tashev, I. ; Slaney, M.
Author_Institution :
Microsoft Res., Redmond, WA, USA
fYear :
2013
fDate :
10-15 Feb. 2013
Firstpage :
1
Lastpage :
6
Abstract :
Audio signal enhancement often involves the application of a time-varying filter, or suppression rule, to the frequency-domain transform of a corrupted signal. Classic approaches use rules derived under Gaussian models and interpret them as spectral estimators in a Bayesian statistical framework. This mathematical approach provides rules that satisfy certain optimization criteria - maximum likelihood, mean square error, etc. In this paper we propose to learn the suppression rule from a representative training corpus and make it optimal in the sense of best perceived quality. This can be measured, for example, with the wideband PESQ algorithm, for which we cannot derive an analytic estimator. The proposed suppression rule is evaluated in controlled environment and shows improvements in the range of 0.1-0.2 PESQ points on a data corpus with SNRs ranging from -10 to +50 dB.
Keywords :
Gaussian processes; audio signal processing; frequency-domain analysis; optimisation; speech enhancement; time-varying filters; Bayesian statistical framework; Gaussian models; SNR; audio signal enhancement; data driven suppression rule; frequency-domain transform; gain -10 dB to 50 dB; mathematical approach; mean square error; optimization criteria; signal corruption; speech enhancement; time-varying filter; wideband PESQ algorithm; Mathematical model; Maximum likelihood estimation; Optimization; Signal to noise ratio; Speech; Speech enhancement; noise suppression; speech enhancement; suppression rule;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory and Applications Workshop (ITA), 2013
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-4648-1
Type :
conf
DOI :
10.1109/ITA.2013.6502979
Filename :
6502979
Link To Document :
بازگشت