Title :
On the limits of speech recognition in noise
Author :
Peters, S. Douglas ; Stubley, Peter ; Valin, Jean-Man
Author_Institution :
Nortel Technol., Montreal, Que., Canada
Abstract :
We consider the performance of speech recognition in noise and focus on its sensitivity to the acoustic feature set. In particular, we examine the perceived information reduction imposed on a speech signal using a feature extraction method commonly used for automatic speech recognition. We observe that the human recognition rates on noisy digit strings drop considerably as the speech signal undergoes the typical loss of phase and loss of frequency resolution. Steps are taken to ensure that human subjects are constrained in ways similar to that of an automatic recognizer. The high correlation between the performance of the human listeners and that of our connected digit recognizer leads us to some interesting conclusions, including that typical cepstral processing is insufficient to support speech information in noise
Keywords :
acoustic signal processing; cepstral analysis; feature extraction; noise; speech processing; speech recognition; acoustic feature set; automatic speech recognition; cepstral processing; connected digit recognizer; correlation; feature extraction method; frequency resolution loss; human listeners; human recognition rates; noisy digit strings; perceived information reduction; performance; phase loss; speech signal; Acoustic noise; Automatic speech recognition; Cepstral analysis; Feature extraction; Frequency; Humans; Phase noise; Signal resolution; Speech enhancement; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758138