Title :
Two novel FDLP based feature extraction methods for improvement of speech recognition
Author :
Shekofteh, Yasser ; Almasganj, Farshad ; Rezaei, Ahmadreza ; Goodarzi, Mohammad Mohsen
Author_Institution :
Biomed. Eng. Fac., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
In conventional automatic speech recognition systems, linguistic information of the speech signal are usually acquired from short-time frames about 10-30 ms. In this paper we have proposed two novel methods extracting the long-term information of the speech signal. Both of the methods are based on “sub-band FDLP” which divides the long-time frame of signal into several sub-bands. Using the MFCC algorithm, we are able to represent the long-term temporal features of the each sub-band. Our results show that the proposed methods could improve the recognition ratio by %1.73. The proposed methods were evaluated using the FarsDat database and the method´s robustness against different conditions of noise was experimented.
Keywords :
feature extraction; speech recognition; FarsDat database; MFCC algorithm; feature extraction methods; linguistic information; speech recognition; subband FDLP; Discrete cosine transforms; Feature extraction; Mel frequency cepstral coefficient; Prediction algorithms; Signal to noise ratio; Speech; Speech recognition; Feature extraction; Linear predictive coding; Speech recognition; Time-frequency analysis;
Conference_Titel :
Telecommunications (IST), 2010 5th International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4244-8183-5
DOI :
10.1109/ISTEL.2010.5734095