• DocumentCode
    2998325
  • Title

    A procedure to generate training sequences for a connected word recognizer using the segmental k-means training algorithm

  • Author

    Mikkilineni, R.P. ; Wilpon, J.G. ; Rabiner, L.R.

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • fYear
    1988
  • fDate
    11-14 Apr 1988
  • Firstpage
    433
  • Abstract
    Past research has shown that a connected digit recognition system, based on either word templates or word hidden Markov models (HMM), could effectively be trained using a segmental k-means training procedure. In these studies, a set of randomly generated digit strings of variable length was used to train the recognizer. However, problems were encountered when this training procedure was extended to systems with medium to large vocabularies. For a training set to be effective, it should represent each vocabulary word and its acoustic variability within the context of all valid input strings defined by a task dependent grammar. A procedure to generate a training set of sentences with these properties is proposed. Using this procedure, a training set of sentences was generated for a connected word recognition system simulating an airline flight reservation task. Several speaker dependent automatic speech recognition (ASR) experiments were performed to assess the effectiveness of the training set generated using the new procedure. The results of these experiments showed that the string accuracy was about 98% when tested on independent sets of test sentences for the three talkers
  • Keywords
    Markov processes; speech recognition; airline flight reservation; connected digit recognition system; connected word recognizer; segmental k-means training algorithm; speaker dependent automatic speech recognition; string accuracy; training sequences generation; word hidden Markov models; word templates; Aerospace simulation; Automatic speech recognition; Hidden Markov models; Laboratories; Pattern recognition; Rain; Speech recognition; Testing; Text recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1988.196611
  • Filename
    196611