DocumentCode
3520568
Title
Large vocabulary word recognition based on demi-syllable hidden Markov model using small amount of training data
Author
Yoshida, Kenta ; Watanabe, Toshio
Author_Institution
NEC Corp., Kawasaki
fYear
1989
fDate
23-26 May 1989
Firstpage
1
Abstract
The authors present a large-vocabulary speech recognition method based on hidden Markov models (HMMs) and aimed at high recognition performance with a small amount of training data. The recognition model is designed to treat contextual and allophonic variations utilizing acoustic-phonetic knowledge. The demisyllable is used as a recognition unit to treat contextual variations caused by the coarticulation effect. A single Gaussian probability density function is used as the HMM output probability, and allophonic units are defined to deal with greater allophonic variations, such as vowel devoicing. In an experiment, demisyllable models were trained using a 250 training word set, and 99.0% and 97.5% recognition rates were obtained for 500-word and 1800-word vocabularies, respectively. The result demonstrates the effectiveness of the method
Keywords
Markov processes; speech recognition; Gaussian probability density function; HMM output probability; acoustic-phonetic knowledge; allophonic variations; coarticulation effect; contextual variations; demisyllable; hidden Markov model; large-vocabulary speech recognition; speaker dependent; training data; vowel devoicing; word recognition; Context modeling; Dictionaries; Hidden Markov models; Information technology; Laboratories; National electric code; Probability density function; Speech recognition; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location
Glasgow
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1989.266348
Filename
266348
Link To Document