Title :
Gaussian mixture selection using context-independent HMM
Author :
Lee, Akinobu ; Kawahara, Tatsuya ; Shikano, Kiyohiro
Author_Institution :
Nara Inst. of Sci. & Technol., Ikoma, Japan
Abstract :
We address a method to efficiently select Gaussian mixtures for fast acoustic likelihood computation. It makes use of context-independent models for selection and back-off of corresponding triphone models. Specifically, for the k-best phone models by the preliminary evaluation, triphone models of higher resolution are applied, and others are assigned likelihoods with the monophone models. This selection scheme assigns more reliable back-off likelihoods to the un-selected states than the conventional Gaussian selection based on a VQ codebook. It can also incorporate efficient Gaussian pruning at the preliminary evaluation, which offsets the increased size of the pre-selection model. Experimental results show that the proposed method achieves comparable performance as the standard Gaussian selection, and performs much better under aggressive pruning condition. Together with the phonetic tied-mixture modeling, acoustic matching cost is reduced to almost 14% with little loss of accuracy
Keywords :
decoding; hidden Markov models; probability; pulse time modulation; search problems; speech recognition; Gaussian mixture selection; Gaussian pruning; acoustic likelihood computation; acoustic matching cost; aggressive pruning; back-off likelihoods; context-independent HMM; context-independent models; phonetic tied-mixture modeling; triphone models; Books; Computational efficiency; Context modeling; Costs; Decoding; Hidden Markov models; Large-scale systems; Real time systems; Speech recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940769