DocumentCode :
1110800
Title :
A weighted cepstral distance measure for speech recognition
Author :
Tohkura, Yoh´ichi
Author_Institution :
ATR Auditory & Visual Perception Research Laboratories, Osaka, Japan
Volume :
35
Issue :
10
fYear :
1987
fDate :
10/1/1987 12:00:00 AM
Firstpage :
1414
Lastpage :
1422
Abstract :
A weighted cepstral distance measure is proposed and is tested in a speaker-independent isolated word recognition system using standard DTW (dynamic time warping) techniques. The measure is a statistically weighted distance measure with weights equal to the inverse variance of the cepstral coefficients. The experimental results show that the weighted cepstral distance measure works substantially better than both the Euclidean cepstral distance and the log likelihood ratio distance measures across two different databases. The recognition error rate obtained using the weighted cepstral distance measure was about 1 percent for digit recognition. This result was less than one-fourth of that obtained using the simple Euclidean cepstral distance measure and about one-third of the results using the log likelihood ratio distance measure. The most significant performance characteristic of the weighted cepstral distance was that it tended to equalize the performance of the recognizer across different talkers.
Keywords :
Cepstral analysis; Character recognition; Databases; Distortion measurement; Euclidean distance; Linear predictive coding; Speech recognition; System testing; Time measurement; Weight measurement;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1987.1165058
Filename :
1165058
Link To Document :
بازگشت