DocumentCode :
2773475
Title :
Efficient post-processing techniques for speech enhancement
Author :
Ramakrishnan, Vyass ; Shetty, Karthik ; Pawan Kumar, G. ; Seelamantula, Chandra Sekhar
Author_Institution :
Dept. of Telecommun. Eng., M.S. Ramaiah Inst. of Technol., Bangalore, India
fYear :
2011
fDate :
28-30 Jan. 2011
Firstpage :
1
Lastpage :
5
Abstract :
We address the problem of speech enhancement in real-world noisy scenarios. We propose to solve the problem in two stages, the first comprising a generalized spectral subtraction technique, followed by a sequence of perceptually-motivated post-processing algorithms. The role of the post-processing algorithms is to compensate for the effects of noise as well as to suppress any artifacts created by the first-stage processing. The key post-processing mechanisms are aimed at suppressing musical noise and to enhance the formant structure of voiced speech as well as to denoise the linear-prediction residual. The parameter values in the techniques are fixed optimally by experimentally evaluating the enhancement performance as a function of the parameters. We used the Carnegie-Mellon university Arctic database for our experiments. We considered three real-world noise types: fan noise, car noise, and motorbike noise. The enhancement performance was evaluated by conducting listening experiments on 12 subjects. The listeners reported a clear improvement (MOS improvement of 0.5 on an average) over the noisy signal in the perceived quality (increase in the mean-opinion score (MOS)) for positive signal-to-noise-ratios (SNRs). For negative SNRs, however, the improvement was found to be marginal.
Keywords :
speech enhancement; Carnegie-Mellon university Arctic database; car noise; fan noise; generalized spectral subtraction technique; linear-prediction residual denoising; mean-opinion score; motorbike noise; musical noise suppression; perceptually-motivated post-processing algorithms; real-world noisy scenarios; signal-to-noise-ratios; speech enhancement; Acoustics; Noise measurement; Signal to noise ratio; Speech; Speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (NCC), 2011 National Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-61284-090-1
Type :
conf
DOI :
10.1109/NCC.2011.5734780
Filename :
5734780
Link To Document :
بازگشت