Title :
Large vocabulary word recognition based on demi-syllable hidden Markov model using small amount of training data
Author :
Yoshida, Kenta ; Watanabe, Toshio
Author_Institution :
NEC Corp., Kawasaki
Abstract :
The authors present a large-vocabulary speech recognition method based on hidden Markov models (HMMs) and aimed at high recognition performance with a small amount of training data. The recognition model is designed to treat contextual and allophonic variations utilizing acoustic-phonetic knowledge. The demisyllable is used as a recognition unit to treat contextual variations caused by the coarticulation effect. A single Gaussian probability density function is used as the HMM output probability, and allophonic units are defined to deal with greater allophonic variations, such as vowel devoicing. In an experiment, demisyllable models were trained using a 250 training word set, and 99.0% and 97.5% recognition rates were obtained for 500-word and 1800-word vocabularies, respectively. The result demonstrates the effectiveness of the method
Keywords :
Markov processes; speech recognition; Gaussian probability density function; HMM output probability; acoustic-phonetic knowledge; allophonic variations; coarticulation effect; contextual variations; demisyllable; hidden Markov model; large-vocabulary speech recognition; speaker dependent; training data; vowel devoicing; word recognition; Context modeling; Dictionaries; Hidden Markov models; Information technology; Laboratories; National electric code; Probability density function; Speech recognition; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location :
Glasgow
DOI :
10.1109/ICASSP.1989.266348