DocumentCode :
701488
Title :
Recognition of phonemes from estimation errors
Author :
Baghai-Ravary, L ; Beet, S W
Author_Institution :
Department of Electronic and Electrical Engineering, The University of Sheffield, Mappin Street, Sheffield, S1 3JD, UK
fYear :
1996
fDate :
10-13 Sept. 1996
Firstpage :
1
Lastpage :
4
Abstract :
Speech recognition systems generally use delta and delta-delta (velocity and acceleration) coefficients to characterise the dynamics apparent in frame-based representations of speech. These coefficients can be thought of as the errors of simple predictors. This paper describes the use of error coefficients derived from more advanced (and accurate) forms of prediction and interpolation. Both overall recognition accuracy and the detailed confusions observed are compared with those of the ‘traditional’ methods. The task used is speaker-independent phoneme recognition using a subset of the TIMIT database, and four different speech representations. The error coefficient performance on this task appears to be directly related to the robustness of the estimator used, with the best of the new methods out-performing delta-delta coefficients by around 10%.
Keywords :
Accuracy; Databases; Hidden Markov models; Interpolation; Speech; Speech recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
European Signal Processing Conference, 1996. EUSIPCO 1996. 8th
Conference_Location :
Trieste, Italy
Print_ISBN :
978-888-6179-83-6
Type :
conf
Filename :
7083214
Link To Document :
بازگشت