DocumentCode :
1094849
Title :
Performance tradeoffs in dynamic time warping algorithms for isolated word recognition
Author :
Myers, Cory ; Rabiner, Lawrence R. ; Rosenberg, Aaron E.
Author_Institution :
Massachusetts Institute of Technology, Cambridge, MA
Volume :
28
Issue :
6
fYear :
1980
fDate :
12/1/1980 12:00:00 AM
Firstpage :
623
Lastpage :
635
Abstract :
The technique of dynamic programming for the time registration of a reference and a test pattern has found widespread use in the area of isolated word recognition. Recently, a number of variations on the basic time warping algorithm have been proposed by Sakoe and Chiba, and Rabiner, Rosenberg, and Levinson. These algorithms all assume that the test input is the time pattern of a feature vector from an isolated word whose endpoints are known (at least approximately). The major differences in the methods are the global path constraints (i.e., the region of possible warping paths), the local continuity constraints on the path, and the distance weighting and normalization used to give the overall minimum distance. The purpose of this investigation is to study the effects of such variations on the performance of different dynamic time warping algorithms for a realistic speech database. The performance measures that were used include: speed of operation, memory requirements, and recognition accuracy. The results show that both axis orientation and relative length of the reference and the test patterns are important factors in recognition accuracy. Our results suggest a new approach to dynamic time warping for isolated words in which both the reference and test patterns are linearly warped to a fixed length, and then a simplified dynamic time warping algorithm is used to handle the nonlinear component of the time alignment. Results with this new algorithm show performance comparable to or better than that of all other dynamic time warping algorithms that were studied.
Keywords :
Acoustic testing; Databases; Dynamic programming; Heuristic algorithms; Isolation technology; Pattern recognition; Signal processing algorithms; Speech processing; Speech recognition; Velocity measurement;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1980.1163491
Filename :
1163491
Link To Document :
بازگشت