A procedure to generate training sequences for a connected word recognizer using the segmental k-means training algorithm

Author

Mikkilineni, R.P. ; Wilpon, J.G. ; Rabiner, L.R.

Author_Institution

AT&T Bell Labs., Murray Hill, NJ, USA

fYear

1988

fDate

11-14 Apr 1988

Firstpage

433

Abstract

Past research has shown that a connected digit recognition system, based on either word templates or word hidden Markov models (HMM), could effectively be trained using a segmental k-means training procedure. In these studies, a set of randomly generated digit strings of variable length was used to train the recognizer. However, problems were encountered when this training procedure was extended to systems with medium to large vocabularies. For a training set to be effective, it should represent each vocabulary word and its acoustic variability within the context of all valid input strings defined by a task dependent grammar. A procedure to generate a training set of sentences with these properties is proposed. Using this procedure, a training set of sentences was generated for a connected word recognition system simulating an airline flight reservation task. Several speaker dependent automatic speech recognition (ASR) experiments were performed to assess the effectiveness of the training set generated using the new procedure. The results of these experiments showed that the string accuracy was about 98% when tested on independent sets of test sentences for the three talkers

Keywords

Markov processes; speech recognition; airline flight reservation; connected digit recognition system; connected word recognizer; segmental k-means training algorithm; speaker dependent automatic speech recognition; string accuracy; training sequences generation; word hidden Markov models; word templates; Aerospace simulation; Automatic speech recognition; Hidden Markov models; Laboratories; Pattern recognition; Rain; Speech recognition; Testing; Text recognition; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on

Conference_Location

New York, NY

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.1988.196611

Filename

196611