• DocumentCode
    284608
  • Title

    Segmental GPD training of HMM based speech recognizer

  • Author

    Chou, W. ; Juang, B.H. ; Lee, C.H.

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    473
  • Abstract
    A novel training algorithm, segmental GPD (generalized probabilistic descent) training, for a hidden Markov model (HMM)-based speech recognizer using Viterbi decoding is proposed. This algorithm is based on the principle of minimum recognition error rate in which segmentation and discriminative training are jointly optimized. Various issues related to the special structure of HMM in segmental GPD training are studied. The authors tested this algorithm on two speaker-independent recognition tasks. The first experiment involves English E-set. Segmental GPD training was directly applied to HMM generated from nonoptimal uniform segmentation. A recognition rate of 88.7% was achieved on English E-set with whole word HMM. The second experiment involves the connected digits TI-database. Segmental GPD training was applied to HMM which were already trained using conventional training methods. A string recognition rate of 98.8% was achieved on 10-state word based HMM through segmental GPD training
  • Keywords
    decoding; hidden Markov models; learning (artificial intelligence); speech recognition; 10-state word based HMM; English E-set; HMM based speech recognizer; Viterbi decoding; connected digits TI-database; discriminative training; generalized probabilistic descent; hidden Markov model; minimum recognition error rate; nonoptimal uniform segmentation; recognition rate; segmental GPD training; segmentation; speaker-independent recognition tasks; special HMM structure; string recognition rate; training algorithm; Availability; DC generators; Decoding; Error analysis; Error probability; Hidden Markov models; Speech recognition; Testing; Training data; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225869
  • Filename
    225869