Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination

Author

Singh, Rita ; Seltzer, Michael L. ; Raj, Bhiksha ; Stern, Richard M.

Author_Institution

Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume

1

fYear

2001

fDate

2001

Firstpage

273

Abstract

The first evaluation for Speech in Noisy Environments (SPINE1) was conducted by the Naval Research Labs (NRL) in August, 2000. The purpose of the evaluation was to test existing core speech recognition technologies for speech in the presence of varying types and levels of noise. In this case the noises were taken from military settings. Among the strategies used by Carnegie Mellon University´s successful systems designed for this task were session-adaptive segmentation, robust mel-scale filtering for the computation of cepstra, the use of parallel front-end features and noise-compensation algorithms, and parallel hypotheses combination through word-graphs. This paper describes the motivations behind the design decisions taken for these components, supported by observations and experiments

Keywords

acoustic noise; cepstral analysis; compensation; digital filters; feature extraction; speech recognition; SPINEl; Speech in Noisy Environments; cepstra; design decisions; feature extraction; hypothesis combination; military settings; noise-compensation algorithms; parallel front-end features; parallel hypotheses combination; robust automatic segmentation; robust mel-scale filtering; session -adaptive segmentation; speech recognition; word-graphs; Algorithm design and analysis; Filtering; Military computing; Noise level; Noise robustness; Speech analysis; Speech enhancement; Speech recognition; Testing; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location

Salt Lake City, UT

ISSN

1520-6149

Print_ISBN

0-7803-7041-4

Type

conf

DOI

10.1109/ICASSP.2001.940820

Filename

940820