• DocumentCode
    1403239
  • Title

    On the Relationship Between Bayes Risk and Word Error Rate in ASR

  • Author

    Schlüter, Ralf ; Nussbaum-Thom, Markus ; Ney, Hermann

  • Author_Institution
    Lehrstuhl fur Inf. 6-Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
  • Volume
    19
  • Issue
    5
  • fYear
    2011
  • fDate
    7/1/2011 12:00:00 AM
  • Firstpage
    1103
  • Lastpage
    1112
  • Abstract
    Recently, a number of (approximate) approaches emerged in speech processing, which try to overcome the known lack of match between symbol level evaluation measures (e.g., word error rate) and the standard string (symbol sequence) cost (e.g., sentence error)-based Bayes decision rule, by using symbol level cost functions for Bayes decision rule. Nevertheless, experiments show that for a majority of test samples both decision rules still give equal decisions, especially at lower error rates. In this paper, analytic evidence for these observations is provided. A set of conditions is presented, for which Bayes decision rule with symbol level and string level cost function leads to the same decisions. Furthermore, the case of word error cost represented by the Levenshtein (edit) distance is investigated, which upon others covers the important case of speech recognition. A Hamming distance-based upper bound to the Levenshtein cost function is discussed. This cost function relates to former, word-posterior based decision rules, and the corresponding efficient decision rule is shown to be strongly related to Bayes decision rule with the Levenshtein cost. The analytic results are verified experimentally, and their quantitative effect is studied by experiments on four different well-known large vocabulary automatic speech recognition tasks.
  • Keywords
    Bayes methods; decision theory; speech processing; speech recognition; ASR; Bayes risk; Hamming distance-based upper bound; Levenshtein cost function; Levenshtein distance; edit distance; sentence error-based Bayes decision rule; speech processing; standard string cost; string level cost function; symbol level cost functions; symbol level evaluation measures; symbol sequence cost; vocabulary automatic speech recognition tasks; word error cost; word error rate; word-posterior based decision rules; Approximation methods; Cost function; Error analysis; Measurement; Speech; Speech processing; Speech recognition; Bayes decision rule; Bayes risk; Hamming cost; Levenshtein cost; cost function; edit distance;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2091635
  • Filename
    5667045