• DocumentCode
    2993041
  • Title

    Speaker independent telephone speech recognition

  • Author

    Iizuka, Hideo

  • Author_Institution
    OKI Electric Ind. Co., Ltd., Tokyo, Japan
  • Volume
    10
  • fYear
    1985
  • fDate
    31138
  • Firstpage
    842
  • Lastpage
    845
  • Abstract
    This paper descrives recognition method, reference pattern generation method, and evaluation about the speaker independent recognition for telephone speech response systems. Input utterance is analyzed by 19 channel BPFs. The power and vocal cord source characteristics are normalized. The time normalization is realized by linearly compressing or expanding to 32 frames. The speech pattern undergoes pattern matching with male and female reference patterns, and the category of the nearest reference pattern is taken as the result. It is necessary to optimize the reference patterns so that the speech can be correctly recognized in spite of the difference of formant frequencies, and slight segmentation errors. To optimize the reference patterns, the recognition of the training patterns and updating of the reference patterns are repeated. A total of 256 male and female reference patterns were generated The speech recognition accuracy of this method in recognizing non-training voice data was 95.8% with automatic segmentation.
  • Keywords
    Data analysis; Equations; Flowcharts; Frequency conversion; Least squares approximation; Pattern matching; Pattern recognition; Speech analysis; Speech recognition; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '85.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1985.1168288
  • Filename
    1168288