Title :
Unsupervised spectral subtraction for noise-robust ASR
Author :
Lathoud, Guillaume ; Doss, Mathew Magimai ; Mesot, Bertrand ; Bourlard, Hervé
Author_Institution :
IDIAP Res. Inst., Martigny
Abstract :
This paper proposes a simple, computationally efficient 2-mixture model approach to discriminate between speech and background noise at the magnitude spectrogram level. It is directly derived from observations on real data, and can be used in a fully unsupervised manner, with the EM algorithm. In this paper, the 2-mixture model is used in an "unsupervised spectral subtraction" scheme that can be applied as a pre-processing step for any acoustic feature extraction scheme, such as MFCCs or PLP. The goal is to improve noise-robustness of the acoustic features. Experimental results on both OGI Numbers 95 and Aurora 2 tasks yielded a major improvement on all noise conditions, while retaining a similar performance on clean conditions
Keywords :
expectation-maximisation algorithm; feature extraction; signal denoising; speech enhancement; speech recognition; EM algorithm; acoustic feature extraction; automatic speech recognition; background noise; noise-robustness; unsupervised spectral subtraction; Acoustic noise; Automatic speech recognition; Background noise; Computational modeling; Feature extraction; Frequency estimation; Noise robustness; Spectrogram; Speech enhancement; Speech processing;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location :
San Juan
Print_ISBN :
0-7803-9478-X
Electronic_ISBN :
0-7803-9479-8
DOI :
10.1109/ASRU.2005.1566500