DocumentCode
1090414
Title
A speaker-independent speech-recognition system based on linear prediction
Author
Gupta, Vishwa N. ; Bryan, J. Kent ; Gowdy, John N.
Author_Institution
Clemson University, Clemson, SC
Volume
26
Issue
1
fYear
1978
fDate
2/1/1978 12:00:00 AM
Firstpage
27
Lastpage
33
Abstract
This paper describes a speaker-independent speech-recognition system using autoregression (linear prediction) on speech samples. Isolated words from a standard 40-word reading test vocabulary are spoken by 25 different speakers. A reference pattern for each word is stored as coefficients of the Yule-Walker equations for 50 consecutive overlapped time windows. Various distance measures are then proposed and evaluated in terms of accuracy of recognition and speed of computation. The best measure gives 90.3 percent rate of recognition. Both the nearest-neighbor and K-nearest-neighbor algorithms are used in the decision scheme implemented. The computation is minimized by making sequential decisions after a fixed number of iterations. It is observed that computationally this distance measure coupled with a nonlinear time-warped function for matching of windows gives optimal results. The number of speakers was then increased to 105 to show the statistical significance of the results obtained in this project. The recognition rate obtained with the best procedure for 105 speakers was 89.2 percent. The recognition time for this procedure was 9.8 seconds per utterance.
Keywords
Acoustic measurements; Couplings; Equations; Pattern recognition; Shape measurement; Speech recognition; Testing; Time measurement; Velocity measurement; Vocabulary;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1978.1163054
Filename
1163054
Link To Document