DocumentCode :
1425077
Title :
Discriminative utterance verification for connected digits recognition
Author :
Rahim, Mazin G. ; Lee, Chin-Hui ; Juang, Biing-hwang
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Volume :
5
Issue :
3
fYear :
1997
fDate :
5/1/1997 12:00:00 AM
Firstpage :
266
Lastpage :
277
Abstract :
Utterance verification represents an important technology in the design of user-friendly speech recognition systems. It involves the recognition of keyword strings and the rejection of nonkeyword strings. This paper describes a hidden Markov model-based (HMM-based) utterance verification system using the framework of statistical hypothesis testing. The two major issues on how to design keyword and string scoring criteria are addressed. For keyword verification, different alternative hypotheses are proposed based on the scores of antikeyword models and a general acoustic filler model. For string verification, different measures are proposed with the objective of detecting nonvocabulary word strings and possibly erroneous strings (so-called putative errors). This paper also motivates the need for discriminative hypothesis testing in verification. One such approach based on minimum classification error training is investigated in detail. When the proposed verification technique was integrated into a state-of-the-art connected digit recognition system, the string error rate for valid digit strings was found to decrease by 57% when setting the rejection rate to 5%. Furthermore, the system was able to correctly reject over 99.9% of nonvocabulary word strings
Keywords :
error statistics; hidden Markov models; speech processing; speech recognition; statistical analysis; HMM utterance verification system; antikeyword models; connected digits recognition; discriminative hypothesis testing; discriminative utterance verification; erroneous strings; general acoustic filler model; hidden Markov model; keyword strings recognition; keyword verification; minimum classification error training; nonkeyword strings rejection; nonvocabulary word strings; putative errors; rejection rate; statistical hypothesis testing; string error rate; string scoring criteria; user-friendly speech recognition systems; Acoustic measurements; Acoustic signal detection; Acoustic testing; Error analysis; Event detection; Hidden Markov models; Speech recognition; System testing; Telephony; Vocabulary;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.568733
Filename :
568733
Link To Document :
بازگشت