Automatic speech recognition based on weighted minimum classification error (W-MCE) training method

Author

Fu, Qiang ; Juang, Biing-Hwang

Author_Institution

Georgia Inst. of Technol., Atlanta

fYear

2007

fDate

9-13 Dec. 2007

Firstpage

278

Lastpage

283

Abstract

The Bayes decision theory is the foundation of the classical statistical pattern recognition approach. For most of pattern recognition problems, the Bayes decision theory is employed assuming that the system performance metric is defined as the simple error counting, which assigns identical cost to each recognition error. However, this prevalent performance metric is not desirable in many practical applications. For example, the cost of "recognition" error is required to be differentiated in keyword spotting systems. In this paper, we propose an extended framework for the speech recognition problem with non-uniform classification/recognition error cost. As the system performance metric, the recognition error is weighted based on the task objective. The Bayes decision theory is employed according to this performance metric and the decision rule with a non-uniform error cost function is derived. We argue that the minimum classification error (MCE) method, after appropriate generalization, is the most suitable training algorithm for the "optimal" classifier design to minimize the weighted error rate. We formulate the weighted MCE (W-MCE) algorithm based on the conventional MCE infrastructure by integrating the error cost and the recognition error count into one objective function. In the context of automatic speech recognition (ASR), we present a variety of training scenarios and weighting strategies under this extended framework. The experimental demonstration for large vocabulary continuous speech recognition is provided to support the effectiveness of our approach.

Keywords

Bayes methods; decision theory; error statistics; learning (artificial intelligence); minimisation; pattern classification; speech recognition; Bayes decision theory; automatic speech recognition; classical statistical pattern recognition; keyword spotting system; large vocabulary continuous speech recognition; nonuniform error cost function; objective function; weighted minimum classification error training method; Algorithm design and analysis; Automatic speech recognition; Cost function; Decision theory; Error analysis; Measurement; Pattern recognition; Speech recognition; System performance; Vocabulary; non-uniform error cost; weighted MCE;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on

Conference_Location

Kyoto

Print_ISBN

978-1-4244-1746-9

Electronic_ISBN

978-1-4244-1746-9

Type

conf

DOI

10.1109/ASRU.2007.4430124

Filename

4430124