• DocumentCode
    1090414
  • Title

    A speaker-independent speech-recognition system based on linear prediction

  • Author

    Gupta, Vishwa N. ; Bryan, J. Kent ; Gowdy, John N.

  • Author_Institution
    Clemson University, Clemson, SC
  • Volume
    26
  • Issue
    1
  • fYear
    1978
  • fDate
    2/1/1978 12:00:00 AM
  • Firstpage
    27
  • Lastpage
    33
  • Abstract
    This paper describes a speaker-independent speech-recognition system using autoregression (linear prediction) on speech samples. Isolated words from a standard 40-word reading test vocabulary are spoken by 25 different speakers. A reference pattern for each word is stored as coefficients of the Yule-Walker equations for 50 consecutive overlapped time windows. Various distance measures are then proposed and evaluated in terms of accuracy of recognition and speed of computation. The best measure gives 90.3 percent rate of recognition. Both the nearest-neighbor and K-nearest-neighbor algorithms are used in the decision scheme implemented. The computation is minimized by making sequential decisions after a fixed number of iterations. It is observed that computationally this distance measure coupled with a nonlinear time-warped function for matching of windows gives optimal results. The number of speakers was then increased to 105 to show the statistical significance of the results obtained in this project. The recognition rate obtained with the best procedure for 105 speakers was 89.2 percent. The recognition time for this procedure was 9.8 seconds per utterance.
  • Keywords
    Acoustic measurements; Couplings; Equations; Pattern recognition; Shape measurement; Speech recognition; Testing; Time measurement; Velocity measurement; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/TASSP.1978.1163054
  • Filename
    1163054