• DocumentCode
    1425077
  • Title

    Discriminative utterance verification for connected digits recognition

  • Author

    Rahim, Mazin G. ; Lee, Chin-Hui ; Juang, Biing-hwang

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    5
  • Issue
    3
  • fYear
    1997
  • fDate
    5/1/1997 12:00:00 AM
  • Firstpage
    266
  • Lastpage
    277
  • Abstract
    Utterance verification represents an important technology in the design of user-friendly speech recognition systems. It involves the recognition of keyword strings and the rejection of nonkeyword strings. This paper describes a hidden Markov model-based (HMM-based) utterance verification system using the framework of statistical hypothesis testing. The two major issues on how to design keyword and string scoring criteria are addressed. For keyword verification, different alternative hypotheses are proposed based on the scores of antikeyword models and a general acoustic filler model. For string verification, different measures are proposed with the objective of detecting nonvocabulary word strings and possibly erroneous strings (so-called putative errors). This paper also motivates the need for discriminative hypothesis testing in verification. One such approach based on minimum classification error training is investigated in detail. When the proposed verification technique was integrated into a state-of-the-art connected digit recognition system, the string error rate for valid digit strings was found to decrease by 57% when setting the rejection rate to 5%. Furthermore, the system was able to correctly reject over 99.9% of nonvocabulary word strings
  • Keywords
    error statistics; hidden Markov models; speech processing; speech recognition; statistical analysis; HMM utterance verification system; antikeyword models; connected digits recognition; discriminative hypothesis testing; discriminative utterance verification; erroneous strings; general acoustic filler model; hidden Markov model; keyword strings recognition; keyword verification; minimum classification error training; nonkeyword strings rejection; nonvocabulary word strings; putative errors; rejection rate; statistical hypothesis testing; string error rate; string scoring criteria; user-friendly speech recognition systems; Acoustic measurements; Acoustic signal detection; Acoustic testing; Error analysis; Event detection; Hidden Markov models; Speech recognition; System testing; Telephony; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.568733
  • Filename
    568733