DocumentCode :
2574020
Title :
Using Gaussian mixture modeling in speech recognition
Author :
Zhang, Yaxin ; Alder, Mike ; Togneri, Roberto
Author_Institution :
Centre for Intelligent Inf. Process. Syst., Western Australia Univ., Nedlands, WA, Australia
fYear :
1994
fDate :
19-22 Apr 1994
Abstract :
The paper describes a speaker-independent isolated word recognition system which uses a well known technique, the combination of vector quantization with hidden Markov modeling. The conventional vector quantization algorithm is substituted by a statistical clustering algorithm, the expectation-maximization algorithm, in this system. Based on the investigation of the data space, the phonemes were manually extracted from the training data and were used to generate the Gaussians in a code book in which each code word is a Gaussian rather than a centroid vector of the data class. Word-based hidden Markov modeling was then performed. Two English isolated digits data bases were investigated and the 12 Mel-spaced filter bank coefficients employed as the input feature. Compared with the conventional discrete HMM, the present system obtained a significant improvement of recognition accuracy
Keywords :
Gaussian distribution; hidden Markov models; speech coding; speech recognition; vector quantisation; English isolated digits data bases; Gaussian mixture modeling; Mel-spaced filter bank coefficients; code book; expectation-maximization algorithm; hidden Markov modeling; input feature; phonemes; speaker-independent isolated word recognition system; speech recognition; statistical clustering algorithm; training data; vector quantization; Books; Clustering algorithms; Filter bank; Hidden Markov models; Information processing; Intelligent systems; Random number generation; Signal generators; Speech recognition; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
ISSN :
1520-6149
Print_ISBN :
0-7803-1775-0
Type :
conf
DOI :
10.1109/ICASSP.1994.389219
Filename :
389219
Link To Document :
بازگشت