A speaker-independent speech-recognition system based on linear prediction

Author

Gupta, Vishwa N. ; Bryan, J. Kent ; Gowdy, John N.

Author_Institution

Clemson University, Clemson, SC

Volume

26

Issue

1

fYear

1978

fDate

2/1/1978 12:00:00 AM

Firstpage

27

Lastpage

33

Abstract

This paper describes a speaker-independent speech-recognition system using autoregression (linear prediction) on speech samples. Isolated words from a standard 40-word reading test vocabulary are spoken by 25 different speakers. A reference pattern for each word is stored as coefficients of the Yule-Walker equations for 50 consecutive overlapped time windows. Various distance measures are then proposed and evaluated in terms of accuracy of recognition and speed of computation. The best measure gives 90.3 percent rate of recognition. Both the nearest-neighbor and K-nearest-neighbor algorithms are used in the decision scheme implemented. The computation is minimized by making sequential decisions after a fixed number of iterations. It is observed that computationally this distance measure coupled with a nonlinear time-warped function for matching of windows gives optimal results. The number of speakers was then increased to 105 to show the statistical significance of the results obtained in this project. The recognition rate obtained with the best procedure for 105 speakers was 89.2 percent. The recognition time for this procedure was 9.8 seconds per utterance.

Keywords

Acoustic measurements; Couplings; Equations; Pattern recognition; Shape measurement; Speech recognition; Testing; Time measurement; Velocity measurement; Vocabulary;

fLanguage

English

Journal_Title

Acoustics, Speech and Signal Processing, IEEE Transactions on

Publisher

ieee

ISSN

0096-3518

Type

jour

DOI

10.1109/TASSP.1978.1163054

Filename

1163054