DocumentCode
388086
Title
Increased noise immunity in large vocabulary speech recognition with the aid of spectral subtraction
Author
Van Compernolle, Dirk
Author_Institution
IBM T. J. Watson Research Center, Yorktown Heights, NY
Volume
12
fYear
1987
fDate
31868
Firstpage
1143
Lastpage
1146
Abstract
This paper presents several ways of making the signal processing in the IBM speech recognition system more robust with respect to variations in the background noise level. The underlying problem is that the speech recognition system trains on the specific noise circumstances of the training session. A simple solution lays in the controlled addition of noise. The level of noise that has to be added in to effectively mask all background noise is rather high and causes a significant reduction in accuracy. Spectral subtraction does a better job in a limited number of cases, but the thresholding in spectral subtraction often leads to training problems in the hidden Markov model based recognition system. The best results were obtained by reintroducing a semi-natural background by adding noise after applying spectral subtraction.
Keywords
Background noise; Dynamic range; Hidden Markov models; Histograms; Noise level; Signal processing; Speech enhancement; Speech recognition; Vocabulary; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
Type
conf
DOI
10.1109/ICASSP.1987.1169799
Filename
1169799
Link To Document