• DocumentCode
    786817
  • Title

    A speaker-independent connected digit recognition system concatenating statistically discriminated words

  • Author

    Ukita, Teruhiko ; Saito, Etsuo ; Nitta, Tsuneo ; Watanabe, Sadakazu

  • Author_Institution
    Toshiba Kansai Res. Lab., Kobe, Japan
  • Volume
    40
  • Issue
    10
  • fYear
    1992
  • fDate
    10/1/1992 12:00:00 AM
  • Firstpage
    2414
  • Lastpage
    2424
  • Abstract
    A recognition system for connected digits, which uses a statistical classifier to identify words in speaker-independent continuous speech, is described. The system uses the multiple similarity method, a statistical pattern recognition technique. For evaluating word strings, the system uses a scoring method that is independent of the number of words in the strings. It is derived from the a posteriori probability that a subinterval corresponds to a correct word position, giving a word similarity value. The system evaluates a word string using dynamic programming and a parallel search procedure. Experiments for the contextual effect of the training data set, for validation of the search algorithm, and for a large quantity of unspecified speakers including 40 males and 40 females were performed. For connected digits (unknown word lengths test), the string recognition rates were 90.1%-95.1% for two, three, or four connected digits, where the equivalent word (digit) rates were 97.4%-98.4%
  • Keywords
    dynamic programming; speech recognition; statistical analysis; a posteriori probability; continuous speech; dynamic programming; multiple similarity method; parallel search procedure; scoring method; speaker-independent connected digit recognition; statistical classifier; statistical pattern recognition; statistically discriminated words concentration; unknown word lengths test; word position; word similarity value; word strings; Algorithm design and analysis; Communications technology; Computational efficiency; Dynamic programming; Man machine systems; Performance evaluation; Speech analysis; Speech recognition; Testing; Training data;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1053-587X
  • Type

    jour

  • DOI
    10.1109/78.157286
  • Filename
    157286