DocumentCode :
417293
Title :
Universal compensation -- an approach to noisy speech recognition assuming no knowledge of noise
Author :
Ming, Ji
Author_Institution :
Sch. of Comput. Sci., Queen´´s Univ., Belfast, UK
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
We aim to develop an acoustic model for noisy speech recognition that is "trained once, suits all", in terms of offering a recognition performance close to the matched training-testing condition performance based only on clean speech training data. This paper describes such a method termed universal compensation (UC), for its ability to accommodate arbitrary additive noise without assuming any knowledge about the noise. The new UC method consists of two parts: 1) converting full-band spectral corruption into partial-band spectral corruption through compensations for simulated wide-band flat-spectrum noise at consecutive SNRs (signal-to-noise ratios); and 2) reducing the effect of the remaining partial frequency-band corruption on recognition by ignoring the severely mismatched spectral components and basing the recognition mainly on the matched or appropriately compensated spectral components. Experiments on Aurora 2 indicate that the new model, trained from clean data, has achieved a performance comparable to the performance obtained by the baseline system trained on multi-condition data; experiments with noises unseen in Aurora 2 have shown significant improvement for the new model over the baseline model with multi-condition training.
Keywords :
acoustic noise; compensation; speech recognition; SNR; additive background noise; clean data trained model; compensated spectral components; full-band spectral corruption; matched spectral components; noisy speech acoustic model; noisy speech recognition; partial-band spectral corruption; signal-to-noise ratio; universal noise compensation method; unknown arbitrary additive noise; wide-band flat-spectrum noise; Acoustic noise; Additive noise; Background noise; Noise reduction; Speech enhancement; Speech recognition; Testing; Training data; Wiener filter; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326147
Filename :
1326147
Link To Document :
بازگشت