Title :
Speech recognition performance as an effective perceived quality predictor
Author :
Jiang, Wenyu ; Schulzrinne, Henning
Author_Institution :
Dept. of Comput. Sci., Columbia Univ., New York, NY, USA
fDate :
6/24/1905 12:00:00 AM
Abstract :
Determining the perceived quality of packet audio under packet loss usually requires human-based mean opinion score (MOS) listening tests. We propose a new MOS estimation method based on machine speech recognition. Its automated, machine-based nature facilitates real-time monitoring of transmission quality without the need to conduct time-consuming listening tests. Our evaluation of this new method shows that it can use the word recognition ratio metric to reliably predict perceived quality. In particular, we find that although the absolute word recognition ratio of a speech recognizer may vary depending on the speaker, the relative word recognition ratio, obtained by dividing the absolute word recognition ratio with its own value at 0% loss, is speaker-independent. Therefore the relative word recognition ratio is well suited as a universal, speaker-independent MOS predictor. Finally we have also conducted human-based word recognition tests and examined its relationship with machine-based recognition results. Our analysis shows that they are correlated although not very linearly. Also we find that human-based word recognition ratio does not degrade significantly once packet loss is large (≥10%).
Keywords :
Internet telephony; correlation methods; speech intelligibility; speech recognition; Internet telephony; MOS estimation method; VoIP; absolute word recognition ratio; correlation; human-based MOS listening tests; human-based word recognition tests; machine speech recognition; mean opinion score listening tests; packet audio; packet loss; perceived quality predictor; real-time transmission quality monitoring; relative word recognition ratio; speaker-independent MOS predictor; speech recognition performance; voice over IP; Automatic testing; Computer science; Computerized monitoring; Condition monitoring; Degradation; Equations; Internet; Postal services; Speech analysis; Speech recognition;
Conference_Titel :
Quality of Service, 2002. Tenth IEEE International Workshop on
Print_ISBN :
0-7803-7426-6
DOI :
10.1109/IWQoS.2002.1006595