• DocumentCode
    3520568
  • Title

    Large vocabulary word recognition based on demi-syllable hidden Markov model using small amount of training data

  • Author

    Yoshida, Kenta ; Watanabe, Toshio

  • Author_Institution
    NEC Corp., Kawasaki
  • fYear
    1989
  • fDate
    23-26 May 1989
  • Firstpage
    1
  • Abstract
    The authors present a large-vocabulary speech recognition method based on hidden Markov models (HMMs) and aimed at high recognition performance with a small amount of training data. The recognition model is designed to treat contextual and allophonic variations utilizing acoustic-phonetic knowledge. The demisyllable is used as a recognition unit to treat contextual variations caused by the coarticulation effect. A single Gaussian probability density function is used as the HMM output probability, and allophonic units are defined to deal with greater allophonic variations, such as vowel devoicing. In an experiment, demisyllable models were trained using a 250 training word set, and 99.0% and 97.5% recognition rates were obtained for 500-word and 1800-word vocabularies, respectively. The result demonstrates the effectiveness of the method
  • Keywords
    Markov processes; speech recognition; Gaussian probability density function; HMM output probability; acoustic-phonetic knowledge; allophonic variations; coarticulation effect; contextual variations; demisyllable; hidden Markov model; large-vocabulary speech recognition; speaker dependent; training data; vowel devoicing; word recognition; Context modeling; Dictionaries; Hidden Markov models; Information technology; Laboratories; National electric code; Probability density function; Speech recognition; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
  • Conference_Location
    Glasgow
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1989.266348
  • Filename
    266348