DocumentCode :
290108
Title :
Application of vector quantized hidden Markov modeling to telephone network based connected digit recognition
Author :
Buhrke, E.R. ; Cardin, Regis ; Normandin, Yves ; Rahim, Mazin ; Wilpon, Jay
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Volume :
i
fYear :
1994
fDate :
19-22 Apr 1994
Abstract :
Connected digit speech recognition in the telephone network is becoming increasingly more important as the demand for speech technology becomes widespread. In the past few years, several highly successful techniques for recognizing spoken connected digit strings have been proposed. Although these techniques have been applied to non-telephone based speech [e.g. Texas Instruments database], they have produced high recognition performance. Further, similar levels of performances have been demonstrated using discrete density and continuous density based hidden Markov models (HMMs). The success of the vector quantized (VQ) modeling approach, in particular, is encouraging and rather important from the viewpoint of computational efficiency. This paper presents a study of connected digit recognition on telephone network based data using VQ HMMs. We investigate several factors affecting the error rate of VQ HMMs-such as maximum mutual information (MMI) training, sender modeling, and codebook size-and measure their contributions to recognition accuracy. The model architecture, number of states and transitions, is also optimized and its contribution to overall performance discussed
Keywords :
hidden Markov models; speech coding; speech recognition; telephone networks; telephony; vector quantisation; VQ HMM; codebook size; computational efficiency; connected digit speech recognition; error rate; hidden Markov models; maximum mutual information training; model architecture; performance; recognition accuracy; recognition performance; sender modeling; speech technology; telephone network; vector quantized hidden Markov modeling; Computational efficiency; Computer architecture; Databases; Error analysis; Hidden Markov models; Instruments; Mutual information; Size measurement; Speech recognition; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
ISSN :
1520-6149
Print_ISBN :
0-7803-1775-0
Type :
conf
DOI :
10.1109/ICASSP.1994.389344
Filename :
389344
Link To Document :
بازگشت