DocumentCode :
2286830
Title :
An improved HMM/VQ training procedure for speaker-independent isolated word recognition
Author :
Zhang, Yaxin ; Alder, Mike
Author_Institution :
Dept. of Electr. & Electron. Eng., Western Australia Univ., Nedlands, WA, Australia
fYear :
1994
fDate :
13-16 Apr 1994
Firstpage :
722
Abstract :
This paper describe an improved training procedure in a HMM/VQ speech recognition system for speaker-independent speech recognition. The phoneme based Gaussian mixture models (GMM) were generated in the first step modeling using the Expectation-Maximization (EM) algorithm. These Gaussians more accurately describe the distribution characteristic of the phonemes in the speech signal space. Therefore better first step modeling is achieved and the performance of the whole recognition system is improved. The new method was used in a speaker-independent isolated digits and phoneme recognition tasks. Two English databases were used for the training and testing. Significant improvements have been achieved in comparison with the conventional HMM/VQ system
Keywords :
hidden Markov models; speech recognition; stochastic processes; vector quantisation; English databases; HMM/VQ training procedure; distribution characteristic; expectation-maximization algorithm; phoneme based Gaussian mixture models; speaker-independent isolated word recognition; speech signal space; Books; Clustering algorithms; Hidden Markov models; Image coding; Image recognition; Signal generators; Signal processing; Signal processing algorithms; Speech recognition; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
Print_ISBN :
0-7803-1865-X
Type :
conf
DOI :
10.1109/SIPNN.1994.344810
Filename :
344810
Link To Document :
بازگشت