• DocumentCode
    284582
  • Title

    Continuous speech recognition with modified learning vector quantization algorithm and two-level DP-matching

  • Author

    Makino, Shozo ; Endo, Mitsuru ; Sone, Toshio ; Kido, Ken´iti

  • Author_Institution
    Res. Center for Appl. Inf. Sci., Tohoku Univ., Sendai, Japan
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    597
  • Abstract
    The authors propose a phoneme recognition method based on the learning vector quantization (LVQ2) algorithm. They propose three kinds of modified training algorithms for the LVQ2 algorithm. In the recognition stage, the likelihood matrix is computed using the reference vectors and then the optimum phoneme sequence is computed from the matrix using the two-level dynamic programming (DP)-matching with duration constraints. The recognition score of phonemes in isolated spoken words was 89.1% for the test set. The phoneme recognition scores obtained by the modified LVQ2 algorithm were higher than those obtained by the original LVQ2 algorithm. The authors applied this method to a multi-speaker-dependent phoneme recognition task for continuous speech uttered Bunsetsu-by-Bunsetsu. The phoneme recognition score was 85.5% for the test speech samples in continuous speech
  • Keywords
    dynamic programming; speech recognition; vector quantisation; LVQ2 algorithm; continuous speech recognition; duration constraints; isolated spoken words; learning vector quantization; likelihood matrix; modified LVQ2 algorithm; optimum phoneme sequence; phoneme recognition method; phoneme recognition scores; recognition score; reference vectors; test speech samples; two-level DP-matching; two-level dynamic programming; Clustering algorithms; Clustering methods; Hidden Markov models; Neural networks; Speech coding; Speech recognition; Testing; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225838
  • Filename
    225838