Title :
Using Phase Space based processing to extract proper features for ASR systems
Author :
Shekofteh, Yasser ; Almasganj, Farshad
Author_Institution :
Biomed. Eng. Fac., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
In this paper a feature extraction technique using Reconstructed Phase Spaces (RPS) is presented, which improves the overall performances of typical speech recognition systems. Unlike conventional feature extraction methods that use FFT based algorithm as power spectrum estimation (PSE) of speech signal, the proposed method is based on the trajectory and flow matrix of signal´s RPS. In this manner, a new representation of power spectrum is obtained using two dimensional DFT algorithm by which, we can gain modify versions of common feature extraction methods such as MFCC. We conducted some speech recognition experiments using HTK, the known HMM-based toolkit, over FARSDAT, a known Persian speech corpus. Through this modified version of feature extraction method, we gained 1.35% word error rate improvement in comparison to the baseline system which exploits the typical MFCC feature extraction method.
Keywords :
discrete Fourier transforms; feature extraction; speech recognition; FARSDAT Persian speech corpus; HTK toolkit; MFCC feature extraction; automatic speech recognition systems; discrete Fourier transforms; phase space based processing; power spectrum representation; reconstructed phase space technique; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech processing; Speech recognition; Trajectory; feature extraction; nonlinear dynamics; reconstructed phase space; spectral analysis; speech recognition;
Conference_Titel :
Telecommunications (IST), 2010 5th International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4244-8183-5
DOI :
10.1109/ISTEL.2010.5734094