DocumentCode
1403239
Title
On the Relationship Between Bayes Risk and Word Error Rate in ASR
Author
Schlüter, Ralf ; Nussbaum-Thom, Markus ; Ney, Hermann
Author_Institution
Lehrstuhl fur Inf. 6-Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
Volume
19
Issue
5
fYear
2011
fDate
7/1/2011 12:00:00 AM
Firstpage
1103
Lastpage
1112
Abstract
Recently, a number of (approximate) approaches emerged in speech processing, which try to overcome the known lack of match between symbol level evaluation measures (e.g., word error rate) and the standard string (symbol sequence) cost (e.g., sentence error)-based Bayes decision rule, by using symbol level cost functions for Bayes decision rule. Nevertheless, experiments show that for a majority of test samples both decision rules still give equal decisions, especially at lower error rates. In this paper, analytic evidence for these observations is provided. A set of conditions is presented, for which Bayes decision rule with symbol level and string level cost function leads to the same decisions. Furthermore, the case of word error cost represented by the Levenshtein (edit) distance is investigated, which upon others covers the important case of speech recognition. A Hamming distance-based upper bound to the Levenshtein cost function is discussed. This cost function relates to former, word-posterior based decision rules, and the corresponding efficient decision rule is shown to be strongly related to Bayes decision rule with the Levenshtein cost. The analytic results are verified experimentally, and their quantitative effect is studied by experiments on four different well-known large vocabulary automatic speech recognition tasks.
Keywords
Bayes methods; decision theory; speech processing; speech recognition; ASR; Bayes risk; Hamming distance-based upper bound; Levenshtein cost function; Levenshtein distance; edit distance; sentence error-based Bayes decision rule; speech processing; standard string cost; string level cost function; symbol level cost functions; symbol level evaluation measures; symbol sequence cost; vocabulary automatic speech recognition tasks; word error cost; word error rate; word-posterior based decision rules; Approximation methods; Cost function; Error analysis; Measurement; Speech; Speech processing; Speech recognition; Bayes decision rule; Bayes risk; Hamming cost; Levenshtein cost; cost function; edit distance;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2010.2091635
Filename
5667045
Link To Document