Title :
Investigations into the relationship between measurable speech quality and speech recognition rate for telephony speech
Author :
Sun, Hanwu ; Shue, Louis ; Chen, Jianfeng
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
Abstract :
In this paper, an investigation to establish a possible relationship between the performance of a telephony speech recognition system and the method for objective speech quality assessment described in ITU-T Recommendation P.862, known as Perceptual Evaluation of Speech Quality (PESQ), is presented. Experiments using various additive background noises, and at different separations between the microphone and the sound-source have been conducted to establish such a relationship. The preliminary results suggest that telephony speech recognition rates can be mapped to the mean opinion score (MOS) obtained by PESQ using a relatively simple polynomial relationship. This indicates that the PESQ MOS can act as a reliable predictor for the achievable speech recognition rates for telephony-based speech recognition systems.
Keywords :
speech recognition; telephony; ITU-T Recommendation P.862; PESQ MOS; Perceptual Evaluation of Speech Quality; achievable speech recognition rates; additive background noises; mean opinion score; measurable speech quality; microphone-sound-source separation; performance; telephony speech; Acoustic noise; Automatic speech recognition; Background noise; Degradation; Delay estimation; Speech analysis; Speech codecs; Speech recognition; Telephony; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326123