Title :
Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination
Author :
Singh, Rita ; Seltzer, Michael L. ; Raj, Bhiksha ; Stern, Richard M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
The first evaluation for Speech in Noisy Environments (SPINE1) was conducted by the Naval Research Labs (NRL) in August, 2000. The purpose of the evaluation was to test existing core speech recognition technologies for speech in the presence of varying types and levels of noise. In this case the noises were taken from military settings. Among the strategies used by Carnegie Mellon University´s successful systems designed for this task were session-adaptive segmentation, robust mel-scale filtering for the computation of cepstra, the use of parallel front-end features and noise-compensation algorithms, and parallel hypotheses combination through word-graphs. This paper describes the motivations behind the design decisions taken for these components, supported by observations and experiments
Keywords :
acoustic noise; cepstral analysis; compensation; digital filters; feature extraction; speech recognition; SPINEl; Speech in Noisy Environments; cepstra; design decisions; feature extraction; hypothesis combination; military settings; noise-compensation algorithms; parallel front-end features; parallel hypotheses combination; robust automatic segmentation; robust mel-scale filtering; session -adaptive segmentation; speech recognition; word-graphs; Algorithm design and analysis; Filtering; Military computing; Noise level; Noise robustness; Speech analysis; Speech enhancement; Speech recognition; Testing; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940820