DocumentCode :
284591
Title :
Phonemic HMM constrained by statistical VQ-code transition
Author :
Takahashi, Satoshi ; Matsuoka, Tatsuo ; Shikano, Kiyohiro
Author_Institution :
NTT Human Interface Labs., Tokyo, Japan
Volume :
1
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
553
Abstract :
A hidden Markov modeling technique that uses a statistical modeling of vector quantization (VQ)-code transitions is proposed. A bigram-constrained HMM is obtained by combining a VQ-code bigram and the conventional speaker-independent HMM. The proposed model reduces overlapping of the feature distributions between different phonemes by restricting the local VQ-code transitions. The output probabilities in the model are conditioned by the VQ-code of the previous frame. Therefore, the output probability distribution of the model changes depending on the previous frame even in the same state. A speaker-dependent bigram-constrained HMM is obtained using a VQ-code bigram calculated from utterances of an input speaker. A speaker-independent bigram-constrained HMM is obtained using a VQ-code bigram calculated from utterances of many speakers. The model was evaluated by an 18-Japanese-consonant recognition experiment using 5240 words. The speaker-independent bigram-constrained HMM achieved an average recognition accuracy of 76.3% which is 5.5% higher than the conventional speaker-independent HMM
Keywords :
hidden Markov models; speech coding; speech recognition; vector quantisation; Japanese consonant recognition; VQ-code bigram; VQ-code transition; average recognition accuracy; bigram-constrained HMM; feature distributions; hidden Markov modeling; output probabilities; phonemic HMM; speaker-independent HMM; statistical modeling; vector quantization; Degradation; Hidden Markov models; Humans; Laboratories; Parameter estimation; Probability distribution; Speech; Training data; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.225848
Filename :
225848
Link To Document :
بازگشت