• DocumentCode
    2980380
  • Title

    A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER)

  • Author

    Fiscus, Jonathan G.

  • Author_Institution
    Nat. Inst. of Stand. & Technol., Gaithersburg, MD, USA
  • fYear
    1997
  • fDate
    14-17 Dec 1997
  • Firstpage
    347
  • Lastpage
    354
  • Abstract
    Describes a system developed at NIST to produce a composite automatic speech recognition (ASR) system output when the outputs of multiple ASR systems are available, and for which, in many cases, the composite ASR output has a lower error rate than any of the individual systems. The system implements a “voting” or rescoring process to reconcile differences in ASR system outputs. We refer to this system as the NIST Recognizer Output Voting Error Reduction (ROVER) system. As additional knowledge sources are added to an ASR system (e.g. acoustic and language models), error rates are typically decreased. This paper describes a post-recognition process which models the output generated by multiple ASR systems as independent knowledge sources that can be combined and used to generate an output with reduced error rate. To accomplish this, the outputs of multiple of ASR systems are combined into a single, minimal-cost word transition network (WTN) via iterative applications of dynamic programming (DP) alignments. The resulting network is searched by an automatic rescoring or “voting” process that selects the output sequence with the lowest score
  • Keywords
    dynamic programming; error handling; fault tolerant computing; iterative methods; speech processing; speech recognition; National Institute of Standards and Technology; ROVER; acoustic model; automatic speech recognition; composite system output; dynamic programming alignments; independent knowledge sources; iterative applications; language model; minimal-cost word transition network; output modelling; output sequence selection; post-processing system; recognizer output voting error reduction; reduced word error rates; rescoring process; Automatic speech recognition; Benchmark testing; Conferences; Costs; Dynamic programming; Error analysis; NIST; Statistical analysis; System testing; Voting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
  • Conference_Location
    Santa Barbara, CA
  • Print_ISBN
    0-7803-3698-4
  • Type

    conf

  • DOI
    10.1109/ASRU.1997.659110
  • Filename
    659110