• DocumentCode
    2839202
  • Title

    Discriminative transform for confidence estimation in Mandarin speech recognition

  • Author

    Guo, Gang ; Wang, Ren-Hua

  • fYear
    2004
  • fDate
    15-18 Dec. 2004
  • Firstpage
    269
  • Lastpage
    272
  • Abstract
    In automatic speech recognition (ASR) applications, log likelihood ratio testing (LRT) is one of the most popular techniques to obtain a confidence measure (CM). Unlike the traditional (log likelihood ratio) LLR related method, we apply nonlinear transformations towards LLR before computing string-level CM. Different phonemes may have different transformation functions. Through suitable LLR transformations, the verification performance of those string-level CM may increase. Transformation functions are implemented by a multilayer perceptron (MLP). Two algorithms are used to optimize the parameters of the MLP: one is the minimum verification error (MVE) algorithm; another is the figure-of-merit (FOM) training algorithm. In our Mandarin command recognition system, the two methods remarkably improve the performance of confidence measures for out-of-vocabulary word rejection compared with the performance of standard LRT related CM, and we obtain a best 45.5% relative reduction in equal error rate (EER). In addition, in our Mandarin command recognition experiments, the FOM training algorithm outperforms the MVE algorithm even they share an approximately same best performance, while due to limited experimental setups in our experiments, which algorithm is the better still needs to be explored.
  • Keywords
    backpropagation; error statistics; minimisation; multilayer perceptrons; speech processing; speech recognition; vocabulary; ASR applications; FOM training algorithm; LLR transformations; MLP; MVE; Mandarin command recognition; Mandarin speech recognition; automatic speech recognition; confidence estimation; confidence measure; discriminative transform; equal error rate; figure-of-merit training algorithm; log likelihood ratio testing; minimum verification error; multilayer perceptron; nonlinear transformations; out-of-vocabulary word rejection; parameter optimization; phonemes; string-level CM; verification performance; Automatic speech recognition; Automatic testing; Collision mitigation; Electronic equipment testing; Error analysis; Light rail systems; Measurement standards; Performance evaluation; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2004 International Symposium on
  • Print_ISBN
    0-7803-8678-7
  • Type

    conf

  • DOI
    10.1109/CHINSL.2004.1409638
  • Filename
    1409638