• DocumentCode
    3087535
  • Title

    A novel feature extractor employing regularized MVDR spectrum estimator and subband spectrum enhancement technique

  • Author

    Alam, Mohammad Jahangir ; O´Shaughnessy, D. ; Kenny, P.

  • Author_Institution
    INRS-EMT, Univ. of Quebec, Montreal, QC, Canada
  • fYear
    2013
  • fDate
    12-15 May 2013
  • Firstpage
    342
  • Lastpage
    346
  • Abstract
    This paper presents a novel feature extractor for robust large vocabulary continuous speech recognition (LVCSR) task. For accurate and robust estimation of speech power spectrum we propose to compute the features from the regularized minimum variance distortionless response (regMVDR) spectral estimate instead of the windowed periodogram estimate. A sigmoid shape subband spectrum enhancement technique and a short-time feature normalization, known as short-time mean and scale normalization (STMSN), are also used for robust estimation of the cepstral features for speech recognition task. When evaluated on the AURORA-4 LVCSR corpus proposed feature extractor provides an average relative improvement of 38.5%,35.0%, and 34.3%,30.7%,5.6%, and 7.1% over the MFCC, PLP, MVDR-based MFCC, regMVDR-based MFCC, PNCC and the robust feature extractor of [4], respectively, in terms of the recognition accuracy.
  • Keywords
    feature extraction; speech recognition; AURORA-4 LVCSR; LVCSR; MFCC; MVDR-based MFCC; cepstral features; feature extractor; large vocabulary continuous speech recognition; recognition accuracy; regMVDR spectral estimate; regMVDR-based MFCC, PNCC and; regularized MVDR spectrum estimator; regularized minimum variance distortionless response; short-time feature normalization; short-time mean and scale normalization; sigmoid shape subband spectrum enhancement technique; speech recognition task; subband spectrum enhancement technique; windowed periodogram estimate; Estimation; Feature extraction; Mel frequency cepstral coefficient; Noise; Robustness; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Signal Processing and their Applications (WoSSPA), 2013 8th International Workshop on
  • Conference_Location
    Algiers
  • Type

    conf

  • DOI
    10.1109/WoSSPA.2013.6602388
  • Filename
    6602388