DocumentCode :
2839202
Title :
Discriminative transform for confidence estimation in Mandarin speech recognition
Author :
Guo, Gang ; Wang, Ren-Hua
fYear :
2004
fDate :
15-18 Dec. 2004
Firstpage :
269
Lastpage :
272
Abstract :
In automatic speech recognition (ASR) applications, log likelihood ratio testing (LRT) is one of the most popular techniques to obtain a confidence measure (CM). Unlike the traditional (log likelihood ratio) LLR related method, we apply nonlinear transformations towards LLR before computing string-level CM. Different phonemes may have different transformation functions. Through suitable LLR transformations, the verification performance of those string-level CM may increase. Transformation functions are implemented by a multilayer perceptron (MLP). Two algorithms are used to optimize the parameters of the MLP: one is the minimum verification error (MVE) algorithm; another is the figure-of-merit (FOM) training algorithm. In our Mandarin command recognition system, the two methods remarkably improve the performance of confidence measures for out-of-vocabulary word rejection compared with the performance of standard LRT related CM, and we obtain a best 45.5% relative reduction in equal error rate (EER). In addition, in our Mandarin command recognition experiments, the FOM training algorithm outperforms the MVE algorithm even they share an approximately same best performance, while due to limited experimental setups in our experiments, which algorithm is the better still needs to be explored.
Keywords :
backpropagation; error statistics; minimisation; multilayer perceptrons; speech processing; speech recognition; vocabulary; ASR applications; FOM training algorithm; LLR transformations; MLP; MVE; Mandarin command recognition; Mandarin speech recognition; automatic speech recognition; confidence estimation; confidence measure; discriminative transform; equal error rate; figure-of-merit training algorithm; log likelihood ratio testing; minimum verification error; multilayer perceptron; nonlinear transformations; out-of-vocabulary word rejection; parameter optimization; phonemes; string-level CM; verification performance; Automatic speech recognition; Automatic testing; Collision mitigation; Electronic equipment testing; Error analysis; Light rail systems; Measurement standards; Performance evaluation; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN :
0-7803-8678-7
Type :
conf
DOI :
10.1109/CHINSL.2004.1409638
Filename :
1409638
Link To Document :
بازگشت