DocumentCode
2769299
Title
Automatic speech recognition based on weighted minimum classification error (W-MCE) training method
Author
Fu, Qiang ; Juang, Biing-Hwang
Author_Institution
Georgia Inst. of Technol., Atlanta
fYear
2007
fDate
9-13 Dec. 2007
Firstpage
278
Lastpage
283
Abstract
The Bayes decision theory is the foundation of the classical statistical pattern recognition approach. For most of pattern recognition problems, the Bayes decision theory is employed assuming that the system performance metric is defined as the simple error counting, which assigns identical cost to each recognition error. However, this prevalent performance metric is not desirable in many practical applications. For example, the cost of "recognition" error is required to be differentiated in keyword spotting systems. In this paper, we propose an extended framework for the speech recognition problem with non-uniform classification/recognition error cost. As the system performance metric, the recognition error is weighted based on the task objective. The Bayes decision theory is employed according to this performance metric and the decision rule with a non-uniform error cost function is derived. We argue that the minimum classification error (MCE) method, after appropriate generalization, is the most suitable training algorithm for the "optimal" classifier design to minimize the weighted error rate. We formulate the weighted MCE (W-MCE) algorithm based on the conventional MCE infrastructure by integrating the error cost and the recognition error count into one objective function. In the context of automatic speech recognition (ASR), we present a variety of training scenarios and weighting strategies under this extended framework. The experimental demonstration for large vocabulary continuous speech recognition is provided to support the effectiveness of our approach.
Keywords
Bayes methods; decision theory; error statistics; learning (artificial intelligence); minimisation; pattern classification; speech recognition; Bayes decision theory; automatic speech recognition; classical statistical pattern recognition; keyword spotting system; large vocabulary continuous speech recognition; nonuniform error cost function; objective function; weighted minimum classification error training method; Algorithm design and analysis; Automatic speech recognition; Cost function; Decision theory; Error analysis; Measurement; Pattern recognition; Speech recognition; System performance; Vocabulary; non-uniform error cost; weighted MCE;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location
Kyoto
Print_ISBN
978-1-4244-1746-9
Electronic_ISBN
978-1-4244-1746-9
Type
conf
DOI
10.1109/ASRU.2007.4430124
Filename
4430124
Link To Document