DocumentCode :
417272
Title :
Investigations into the relationship between measurable speech quality and speech recognition rate for telephony speech
Author :
Sun, Hanwu ; Shue, Louis ; Chen, Jianfeng
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
In this paper, an investigation to establish a possible relationship between the performance of a telephony speech recognition system and the method for objective speech quality assessment described in ITU-T Recommendation P.862, known as Perceptual Evaluation of Speech Quality (PESQ), is presented. Experiments using various additive background noises, and at different separations between the microphone and the sound-source have been conducted to establish such a relationship. The preliminary results suggest that telephony speech recognition rates can be mapped to the mean opinion score (MOS) obtained by PESQ using a relatively simple polynomial relationship. This indicates that the PESQ MOS can act as a reliable predictor for the achievable speech recognition rates for telephony-based speech recognition systems.
Keywords :
speech recognition; telephony; ITU-T Recommendation P.862; PESQ MOS; Perceptual Evaluation of Speech Quality; achievable speech recognition rates; additive background noises; mean opinion score; measurable speech quality; microphone-sound-source separation; performance; telephony speech; Acoustic noise; Automatic speech recognition; Background noise; Degradation; Delay estimation; Speech analysis; Speech codecs; Speech recognition; Telephony; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326123
Filename :
1326123
Link To Document :
بازگشت