DocumentCode :
1907440
Title :
Robust HMM phoneme modeling for different speaking styles
Author :
Matsuoka, Tatsuo ; Shikano, Kiyohiro
Author_Institution :
NTT Human Interface Lab., Tokyo, Japan
fYear :
1991
fDate :
14-17 Apr 1991
Firstpage :
265
Abstract :
The authors describe the robustness of six types of phoneme-based HMMs (hidden Markov models) against speaking-style variations. The six types of models are VQ (vector quantization)-based and fuzzy VQ-based discrete HMMs, and single-Gaussian and mixture-Gaussian HMMs with either diagonal or full covariance matrices. The mixture-Gaussian HMM with diagonal covariance matrices, the fuzzy VQ-based discrete HMM, and the single-Gaussian HMM with full covariance matrices show better results than the other three in 18-Japanese-consonant recognition experiments. The authors also propose a model-adaptation technique that combines multiple models using the deleted interpolation. This technique makes models easy to apply to different-speaking-style speech
Keywords :
Markov processes; speech recognition; 18-Japanese-consonant recognition; deleted interpolation; diagonal covariance matrices; full covariance matrices; fuzzy VQ-based discrete HMM; hidden Markov models; mixture-Gaussian HMM; model-adaptation technique; phoneme modeling; robustness; single-Gaussian HMM; speaking-style variations; speech recognition; vector quantization; Covariance matrix; Databases; Hidden Markov models; Humans; Interpolation; Noise robustness; Speech analysis; Speech enhancement; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
ISSN :
1520-6149
Print_ISBN :
0-7803-0003-3
Type :
conf
DOI :
10.1109/ICASSP.1991.150328
Filename :
150328
Link To Document :
بازگشت