• DocumentCode
    1924392
  • Title

    A novel approach for speech feature extraction by Cubic-Log compression in MFCC

  • Author

    Devi, M. Renuga ; Ravichandran, T.

  • Author_Institution
    Dept. of Eng., Karpagam Univ., Coimbatore, India
  • fYear
    2013
  • fDate
    21-22 Feb. 2013
  • Firstpage
    182
  • Lastpage
    186
  • Abstract
    Speech Pre-processing is measured as major step in development of feature vector extraction for an efficient Automatic Speech Recognition (ASR) system. A novel approach for speech feature extraction is by applying the Mel-frequency cepstral co-efficient (MFCC) algorithm using Cubic-Log compression instead of Logarithmic compression in MFCC. In proposed MFCC, the frequency axis is initially warped to the mel-scale which is roughly below 2 kHz and logarithmic above this point. Triangular filter are equally spaced in the mel-scale are applied on the warped spectrum. The result of the filters are compressed using Cubic-Log function and cepstral co-efficient are computed by applying DCT to obtain minimum MFCC feature vector for spoken words. These feature vectors are given as input to classification and Recognition phase. The system is trained and tested by generating MFCC feature vector for 600 isolated words, 256 connected words and 150 sentences in clear and noisy environment. Experiment results shows that with minimum MFCC feature vector is enough for speech recognition system to achieve high recognition rate and its performance is measured based on Mean Square Error (MSE) rate.
  • Keywords
    discrete cosine transforms; feature extraction; filtering theory; mean square error methods; signal classification; speech recognition; ASR system; DCT; MFCC algorithm; MSE rate; Mel-frequency cepstral coefficient; automatic speech recognition system; classification phase; connected word; cubic-log compression; cubic-log function; discrete cosine transform; feature vector extraction; frequency 2 kHz; isolated word; logarithmic compression; mean square error rate; recognition phase; sentence; speech feature extraction; speech preprocessing; warped spectrum; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; Vectors; Cubic-Log Compression; Hidden Markov Model; Mean Square Error (MSE); Mel-frequency cepstral co-efficient (MFCC);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, Informatics and Mobile Engineering (PRIME), 2013 International Conference on
  • Conference_Location
    Salem
  • Print_ISBN
    978-1-4673-5843-9
  • Type

    conf

  • DOI
    10.1109/ICPRIME.2013.6496469
  • Filename
    6496469