A speaker-independent connected digit recognition system concatenating statistically discriminated words

Author

Ukita, Teruhiko ; Saito, Etsuo ; Nitta, Tsuneo ; Watanabe, Sadakazu

Author_Institution

Toshiba Kansai Res. Lab., Kobe, Japan

Volume

40

Issue

10

fYear

1992

fDate

10/1/1992 12:00:00 AM

Firstpage

2414

Lastpage

2424

Abstract

A recognition system for connected digits, which uses a statistical classifier to identify words in speaker-independent continuous speech, is described. The system uses the multiple similarity method, a statistical pattern recognition technique. For evaluating word strings, the system uses a scoring method that is independent of the number of words in the strings. It is derived from the a posteriori probability that a subinterval corresponds to a correct word position, giving a word similarity value. The system evaluates a word string using dynamic programming and a parallel search procedure. Experiments for the contextual effect of the training data set, for validation of the search algorithm, and for a large quantity of unspecified speakers including 40 males and 40 females were performed. For connected digits (unknown word lengths test), the string recognition rates were 90.1%-95.1% for two, three, or four connected digits, where the equivalent word (digit) rates were 97.4%-98.4%

Keywords

dynamic programming; speech recognition; statistical analysis; a posteriori probability; continuous speech; dynamic programming; multiple similarity method; parallel search procedure; scoring method; speaker-independent connected digit recognition; statistical classifier; statistical pattern recognition; statistically discriminated words concentration; unknown word lengths test; word position; word similarity value; word strings; Algorithm design and analysis; Communications technology; Computational efficiency; Dynamic programming; Man machine systems; Performance evaluation; Speech analysis; Speech recognition; Testing; Training data;

fLanguage

English

Journal_Title

Signal Processing, IEEE Transactions on

Publisher

ieee

ISSN

1053-587X

Type

jour

DOI

10.1109/78.157286

Filename

157286