DocumentCode :
3161658
Title :
Indirect model-based speech enhancement
Author :
Roux, Jonathan Le ; Hershey, John R.
Author_Institution :
Mitsubishi Electr. Res. Labs. (MERL), Cambridge, MA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4045
Lastpage :
4048
Abstract :
Model-based speech enhancement methods, such as vector-Taylor series-based methods (VTS) [1, 2], share a common methodology: they estimate speech using the expected value of the clean speech given the noisy speech under a statistical model. We show that it may be better to use the expected value of the noise under the model and subtract it from the noisy observation to form an indirect estimate of the speech. Interestingly, for VTS, this methodology turns out to be related to the application of an SNR-dependent gain to the direct VTS speech estimate. In results obtained on an automotive noise task, this methodology produces an average improvement of 1.6 dB signal-to-noise ratio (SNR), relative to conventional methods.
Keywords :
speech enhancement; statistical analysis; SNR-dependent gain; automotive noise task; direct VTS speech estimation; expected value; indirect model-based speech enhancement; signal-to-noise ratio; speech indirect estimation; statistical model; vector-Taylor series-based methods; Computational modeling; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Speech recognition; Algonquin; Speech enhancement; VTS; log spectrum; vector Taylor series;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288806
Filename :
6288806
Link To Document :
بازگشت