DocumentCode :
3520568
Title :
Large vocabulary word recognition based on demi-syllable hidden Markov model using small amount of training data
Author :
Yoshida, Kenta ; Watanabe, Toshio
Author_Institution :
NEC Corp., Kawasaki
fYear :
1989
fDate :
23-26 May 1989
Firstpage :
1
Abstract :
The authors present a large-vocabulary speech recognition method based on hidden Markov models (HMMs) and aimed at high recognition performance with a small amount of training data. The recognition model is designed to treat contextual and allophonic variations utilizing acoustic-phonetic knowledge. The demisyllable is used as a recognition unit to treat contextual variations caused by the coarticulation effect. A single Gaussian probability density function is used as the HMM output probability, and allophonic units are defined to deal with greater allophonic variations, such as vowel devoicing. In an experiment, demisyllable models were trained using a 250 training word set, and 99.0% and 97.5% recognition rates were obtained for 500-word and 1800-word vocabularies, respectively. The result demonstrates the effectiveness of the method
Keywords :
Markov processes; speech recognition; Gaussian probability density function; HMM output probability; acoustic-phonetic knowledge; allophonic variations; coarticulation effect; contextual variations; demisyllable; hidden Markov model; large-vocabulary speech recognition; speaker dependent; training data; vowel devoicing; word recognition; Context modeling; Dictionaries; Hidden Markov models; Information technology; Laboratories; National electric code; Probability density function; Speech recognition; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location :
Glasgow
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1989.266348
Filename :
266348
Link To Document :
بازگشت