• DocumentCode
    2228098
  • Title

    Time-varying discriminative feature extraction effective for phonetic discrimination

  • Author

    Chengalvarayon, R.

  • Author_Institution
    Lucent Technol., AT&T Bell Labs., Naperville, IL
  • fYear
    1997
  • fDate
    9-12 Sep 1997
  • Firstpage
    767
  • Abstract
    We investigate the interactions of front-end preprocessing and back-end classification techniques in hidden Markov model (HMM) based speech recognition. The proposed model aims at finding an optimal linear transformation on the mel-warped discrete Fourier transform (DFT) with the construction of dynamic feature parameters according to the minimum classification error (MCE) criterion. Experimental results show that state-dependent transformation on mel-warped DFT features is superior in performance to the mel-frequency cepstral coefficients (MFCCs). An error rate reduction of 15% is obtained on a standard 39-class TIMIT phone classification task in comparison with the conventional MCE-trained HMM using MFCCs and delta MFCCs that have not been subject to optimization during training
  • Keywords
    discrete Fourier transforms; feature extraction; hidden Markov models; parameter estimation; pattern classification; speech recognition; time-varying filters; 39-class TIMIT phone classification; HMM; MFCC; back-end classification; dynamic feature parameters; error rate reduction; front-end preprocessing; hidden Markov model; mel-frequency cepstral coefficients; mel-warped DFT features; mel-warped discrete Fourier transform; minimum classification error criterion; optimal linear transformation; performance; phonetic discrimination; speech recognition; state-dependent transformation; time-varying discriminative feature extraction; Data mining; Discrete Fourier transforms; Discrete cosine transforms; Error analysis; Feature extraction; Hidden Markov models; Nonlinear filters; Speech analysis; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information, Communications and Signal Processing, 1997. ICICS., Proceedings of 1997 International Conference on
  • Print_ISBN
    0-7803-3676-3
  • Type

    conf

  • DOI
    10.1109/ICICS.1997.652082
  • Filename
    652082