DocumentCode
2021964
Title
Rejection techniques for digit recognition in telecommunication applications
Author
Villarrubia, Luis ; Acero, Alejandro
Author_Institution
Telefonica I+D, Madrid, Spain
Volume
2
fYear
1993
fDate
27-30 April 1993
Firstpage
455
Abstract
The authors describe a technique for nonkeyword rejection and evaluate it in the context of an audiotex service using the ten Spanish digits. The baseline keyword recognition system is a speaker-independent continuous-density hidden Markov model (HMM) recognizer. The use of an affine transformation to the log-probability of the garbage model, an HMM model trained to account for both nonkeyword speech and nonstationary telephone noises is proposed. The parameters of the transformation for the case of isolated keywords are chosen to minimize a cost function that weighs the keyword error rate, keyword rejection rate, and false acceptance rate according to the a priori probabilities of keyword/nonkeyword and the requirements of the specific application. This technique was also extended to embedded keywords (word spotting). The use of this rejection technique on the audiotext application reduced the total cost function by up to 20% for the isolated-word case and by up to 12% for the word spotting case.<>
Keywords
audio systems; hidden Markov models; learning (artificial intelligence); speech recognition; telephony; Spanish digits; affine transformation; audiotex service; continuous-density hidden Markov model; cost function; digit recognition; embedded keywords; garbage model; isolated keywords; nonkeyword rejection; nonstationary telephone noises; word spotting;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location
Minneapolis, MN, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.1993.319339
Filename
319339
Link To Document