• DocumentCode
    417272
  • Title

    Investigations into the relationship between measurable speech quality and speech recognition rate for telephony speech

  • Author

    Sun, Hanwu ; Shue, Louis ; Chen, Jianfeng

  • Author_Institution
    Inst. for Infocomm Res., Singapore, Singapore
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    In this paper, an investigation to establish a possible relationship between the performance of a telephony speech recognition system and the method for objective speech quality assessment described in ITU-T Recommendation P.862, known as Perceptual Evaluation of Speech Quality (PESQ), is presented. Experiments using various additive background noises, and at different separations between the microphone and the sound-source have been conducted to establish such a relationship. The preliminary results suggest that telephony speech recognition rates can be mapped to the mean opinion score (MOS) obtained by PESQ using a relatively simple polynomial relationship. This indicates that the PESQ MOS can act as a reliable predictor for the achievable speech recognition rates for telephony-based speech recognition systems.
  • Keywords
    speech recognition; telephony; ITU-T Recommendation P.862; PESQ MOS; Perceptual Evaluation of Speech Quality; achievable speech recognition rates; additive background noises; mean opinion score; measurable speech quality; microphone-sound-source separation; performance; telephony speech; Acoustic noise; Automatic speech recognition; Background noise; Degradation; Delay estimation; Speech analysis; Speech codecs; Speech recognition; Telephony; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326123
  • Filename
    1326123