Title :
Inter-digit HMM: connected digit recognition using the Macrophone corpus
Author :
Kao, Yu-Hung ; Netsch, Lorin
Author_Institution :
Texas Instrum. Inc., Dallas, TX, USA
Abstract :
Continuous digit recognition over the telephone channel is a key technology for many telecommunications applications such as voice dialing, automatic banking, and credit card number entry. Speech recognizers usually achieve high performance by modeling the acoustics in hidden Markov models (HMMs) using large numbers of multivariate Gaussian mixtures with assumed diagonal covariance in order to model the variability of different speakers and channel conditions. We present a system that uses single mixture 16 feature Gaussian distribution with an assumed identity covariance to achieve a 1.0% word error and a 5.7% sentence error rate on the Macrophone corpus. We found that inter-digit modeling, discriminant training, and per-utterance adaptation can each contribute about a 30% reduction in the error rate. Using this approach, we can realize a system with relatively low memory requirements
Keywords :
Gaussian distribution; acoustic signal processing; hidden Markov models; speech processing; speech recognition; telecommunication channels; telephony; Gaussian distributions; Macrophone corpus; acoustics modeling; automatic banking; channel conditions; connected digit recognition; credit card number entry; diagonal covariance; discriminant training; error rate reduction; hidden Markov models; identity covariance; interdigit HMM; interdigit modeling; multivariate Gaussian mixtures; per-utterance adaptation; sentence error rate; speech recognizers; telecommunications applications; telephone channel; voice dialing; word error rate; Acoustics; Automatic speech recognition; Banking; Credit cards; Error analysis; Gaussian distribution; Hidden Markov models; Loudspeakers; Speech recognition; Telephony;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.598860