• DocumentCode
    2279548
  • Title

    Verification of multi-class recognition decision using classification approach

  • Author

    Matsui, Tomoko ; Soong, Frank K. ; Juang, Biing-hwang

  • Author_Institution
    ATR Spoken Language Translation Res. Labs., Kyoto, Japan
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    123
  • Lastpage
    126
  • Abstract
    We investigate various strategies to improve the utterance verification performance using a 2-class pattern classifier. They include utilizing N-best candidate scores, modifying segmentation boundaries, applying background and out-of-vocabulary filler models, incorporating contexts, and minimizing verification errors via discriminative training. A connected-digit database containing utterances recorded in a noisy, moving car with a hands-free microphone mounted on a sun-visor is used to evaluate the verification performance. The equal error rate (EER) of word verification is employed as the performance measure in our evaluations. All factors considered in our study and their effects on the verification performance are presented in detail. The EER is reduced from 29%, using the standard likelihood ratio test, down to 21.4%, when all enhancements are integrated together.
  • Keywords
    decision theory; error statistics; microphones; pattern classification; speech enhancement; speech recognition; vocabulary; 2-class pattern classifier; N-best candidate scores; background filler model; classification; connected-digit database; contexts; discriminative training; equal error rate; hands-free microphone; multi-class recognition decision; noisy moving car; out-of-vocabulary filler model; segmentation boundaries; speech enhancement; utterance verification performance; word verification; Automatic speech recognition; Context modeling; Databases; Degradation; Man machine systems; Microphones; Natural languages; Performance evaluation; Testing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
  • Print_ISBN
    0-7803-7343-X
  • Type

    conf

  • DOI
    10.1109/ASRU.2001.1034603
  • Filename
    1034603