Title :
A novel approach for speech feature extraction by Cubic-Log compression in MFCC
Author :
Devi, M. Renuga ; Ravichandran, T.
Author_Institution :
Dept. of Eng., Karpagam Univ., Coimbatore, India
Abstract :
Speech Pre-processing is measured as major step in development of feature vector extraction for an efficient Automatic Speech Recognition (ASR) system. A novel approach for speech feature extraction is by applying the Mel-frequency cepstral co-efficient (MFCC) algorithm using Cubic-Log compression instead of Logarithmic compression in MFCC. In proposed MFCC, the frequency axis is initially warped to the mel-scale which is roughly below 2 kHz and logarithmic above this point. Triangular filter are equally spaced in the mel-scale are applied on the warped spectrum. The result of the filters are compressed using Cubic-Log function and cepstral co-efficient are computed by applying DCT to obtain minimum MFCC feature vector for spoken words. These feature vectors are given as input to classification and Recognition phase. The system is trained and tested by generating MFCC feature vector for 600 isolated words, 256 connected words and 150 sentences in clear and noisy environment. Experiment results shows that with minimum MFCC feature vector is enough for speech recognition system to achieve high recognition rate and its performance is measured based on Mean Square Error (MSE) rate.
Keywords :
discrete cosine transforms; feature extraction; filtering theory; mean square error methods; signal classification; speech recognition; ASR system; DCT; MFCC algorithm; MSE rate; Mel-frequency cepstral coefficient; automatic speech recognition system; classification phase; connected word; cubic-log compression; cubic-log function; discrete cosine transform; feature vector extraction; frequency 2 kHz; isolated word; logarithmic compression; mean square error rate; recognition phase; sentence; speech feature extraction; speech preprocessing; warped spectrum; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; Vectors; Cubic-Log Compression; Hidden Markov Model; Mean Square Error (MSE); Mel-frequency cepstral co-efficient (MFCC);
Conference_Titel :
Pattern Recognition, Informatics and Mobile Engineering (PRIME), 2013 International Conference on
Conference_Location :
Salem
Print_ISBN :
978-1-4673-5843-9
DOI :
10.1109/ICPRIME.2013.6496469