• DocumentCode
    294524
  • Title

    Reducing word error rate on conversational speech from the Switchboard corpus

  • Author

    Jeanrenaud, P. ; Eide, E. ; Chaudhari, U. ; McDonough, J. ; Ng, K. ; Siu, M. ; Gish, H.

  • Author_Institution
    BBN Syst. & Technol. Corp., Cambridge, MA, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    53
  • Abstract
    Speech recognition of conversational speech is a difficult task. The performance levels on the Switchboard corpus had been in the vicinity of 70% word error rate. In this paper, we describe the results of applying a variety of modifications to our speech recognition system and we show their impact on improving the performance on conversational speech. These modifications include the use of more complex models, trigram language models, and cross-word triphone models. We also show the effect of using additional acoustic training on the recognition performance. Finally, we present an approach to dealing with the abundance of short words, and examine how the variable speaking rate found in conversational speech impacts on the performance. Currently, the level of performance is at the vicinity of 50% error, a significant improvement over recent levels
  • Keywords
    error statistics; speech recognition; Switchboard corpus; acoustic training; complex models; conversational speech; cross-word triphone models; performance levels; reducing word error rate; short words; speech recognition; trigram language models; variable speaking rate; Air pollution; Error analysis; Performance analysis; Positron emission tomography; Speech recognition; Telephony; Testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479271
  • Filename
    479271