DocumentCode
336783
Title
On the limits of speech recognition in noise
Author
Peters, S. Douglas ; Stubley, Peter ; Valin, Jean-Man
Author_Institution
Nortel Technol., Montreal, Que., Canada
Volume
1
fYear
1999
fDate
15-19 Mar 1999
Firstpage
365
Abstract
We consider the performance of speech recognition in noise and focus on its sensitivity to the acoustic feature set. In particular, we examine the perceived information reduction imposed on a speech signal using a feature extraction method commonly used for automatic speech recognition. We observe that the human recognition rates on noisy digit strings drop considerably as the speech signal undergoes the typical loss of phase and loss of frequency resolution. Steps are taken to ensure that human subjects are constrained in ways similar to that of an automatic recognizer. The high correlation between the performance of the human listeners and that of our connected digit recognizer leads us to some interesting conclusions, including that typical cepstral processing is insufficient to support speech information in noise
Keywords
acoustic signal processing; cepstral analysis; feature extraction; noise; speech processing; speech recognition; acoustic feature set; automatic speech recognition; cepstral processing; connected digit recognizer; correlation; feature extraction method; frequency resolution loss; human listeners; human recognition rates; noisy digit strings; perceived information reduction; performance; phase loss; speech signal; Acoustic noise; Automatic speech recognition; Cepstral analysis; Feature extraction; Frequency; Humans; Phase noise; Signal resolution; Speech enhancement; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location
Phoenix, AZ
ISSN
1520-6149
Print_ISBN
0-7803-5041-3
Type
conf
DOI
10.1109/ICASSP.1999.758138
Filename
758138
Link To Document