DocumentCode :
1088413
Title :
Distance measures for speech processing
Author :
Gray, Augustine H., Jr. ; Markel, John D.
Author_Institution :
University of California, Santa Barbara, CA
Volume :
24
Issue :
5
fYear :
1976
fDate :
10/1/1976 12:00:00 AM
Firstpage :
380
Lastpage :
391
Abstract :
The properties and interrelationships among four measures of distance in speech processing are theoretically and experimentally discussed. The root mean square (rms) log spectral distance, cepstral distance, likelihood ratio (minimum residual principle or delta coding (DELCO) algorithm), and a cosh measure (based upon two nonsymmetrical likelihood ratios) are considered. It is shown that the cepstral measure bounds the rms log spectral measure from below, while the cosh measure bounds it from above. A simple nonlinear transformation of the likelihood ratio is shown to be highly correlated with the rms log spectral measure over expected ranges. Relationships between distance measure values and perception are also considered. The likelihood ratio, cepstral measure, and cosh measure are easily evaluated recursively from linear prediction filter coefficients, and each has a meaningful and interrelated frequency domain interpretation. Fortran programs are presented for computing the recursively evaluated distance measures.
Keywords :
Autocorrelation; Cepstral analysis; Euclidean distance; Nonlinear filters; Oral communication; Root mean square; Speech analysis; Speech processing; Speech recognition; Testing;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1976.1162849
Filename :
1162849
Link To Document :
بازگشت