DocumentCode :
547721
Title :
Comparison of linear based feature transformations to improve speech recognition performance
Author :
Shekofteh, Yasser ; Almasganj, Farshad ; Goodarzi, Mohammad Mohsen
Author_Institution :
Biomedical Engineering Faculty, Amirkabir University of Technology, Iran
fYear :
2011
fDate :
17-19 May 2011
Firstpage :
1
Lastpage :
4
Abstract :
In automatic speech recognition system a diagonal GMM based CDHMM modeling is commonly used. So there is a need to use reasonable feature transformation to decorrelate input feature vectors to satisfy diagonal GMM assumption. In this paper, we introduce the utilization of the several supervised linear feature transformation in speech recognition tasks. Specially each of these methods has particular projection properties. We show that the proposed OLPP based feature transformation method with preserving local properties of feature vectors in the projected space has the best performance based on our experiment on Persian speech database FARSDAT. Also we has introduced a novel class labeling method to use the supervised feature transformation. Overall system, compared to the baseline features, achieved an error rate reduction of 22.2% on clean condition.
Keywords :
Hidden Markov models; Labeling; Mel frequency cepstral coefficient; Speech; Speech recognition; Vectors; HLDA; LDA; LPP; OLPP; feature extraction; linear transformation; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical Engineering (ICEE), 2011 19th Iranian Conference on
Conference_Location :
Tehran, Iran
Print_ISBN :
978-1-4577-0730-8
Electronic_ISBN :
978-964-463-428-4
Type :
conf
Filename :
5955610
Link To Document :
بازگشت