DocumentCode
284652
Title
Spectral contrast normalization and other techniques for speech recognition in noise
Author
Bateman, D.C. ; Bye, D.K. ; Hunt, M.J.
Author_Institution
Marconi Speech & Information Systems, Portsmouth, UK
Volume
1
fYear
1992
fDate
23-26 Mar 1992
Firstpage
241
Abstract
Three methods of improving speech recognition in noise are considered: energy thresholding, a noise-robust spectral representation called IMELDA, and a set of noise-robust spectral distortion measures. The spectral distortion measures can be seen as normalizing the contrast in the spectrum, a problem which can be transferred to the representation itself, making it computationally more efficient. In speaker-independent alphabet recognition tests in added steady white noise at various levels, IMELDA is shown to outperform a weighted cepstrum representation and be computationally more efficient. With this material and with digits recorded in trucks at a wide range of noise levels, performance is found to depend strongly on the threshold level. Contrast normalization is found to help, but only when the energy threshold is far from its optimum level
Keywords
spectral analysis; speech recognition; white noise; IMELDA; energy thresholding; noise-robust spectral distortion measures; noise-robust spectral representation; performance; speaker-independent alphabet recognition tests; spectral contrast normalization; speech recognition; white noise; Acoustic distortion; Acoustic measurements; Acoustic noise; Distortion measurement; Noise level; Noise measurement; Noise robustness; Speech enhancement; Speech recognition; White noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location
San Francisco, CA
ISSN
1520-6149
Print_ISBN
0-7803-0532-9
Type
conf
DOI
10.1109/ICASSP.1992.225927
Filename
225927
Link To Document