• DocumentCode
    730731
  • Title

    Robust excitation-based features for Automatic Speech Recognition

  • Author

    Drugman, Thomas ; Stylianou, Yannis ; Langzhou Chen ; Xie Chen ; Gales, Mark J. F.

  • Author_Institution
    Cambridge Res. Lab., Toshiba Res. Eur. Ltd., Cambridge, UK
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4664
  • Lastpage
    4668
  • Abstract
    In this paper we investigate the use of noise-robust features characterizing the speech excitation signal as complementary features to the usually considered vocal tract based features for Automatic Speech Recognition (ASR). The proposed Excitation-based Features (EBF) are tested in a state-of-the-art Deep Neural Network (DNN) based hybrid acoustic model for speech recognition. The suggested excitation features expand the set of periodicity features previously considered for ASR, expecting that these features help in a better discrimination of the broad phonetic classes (e.g., fricatives, nasal, vowels, etc.). Our experiments on the AMI meeting transcription system showed that the proposed EBF yield a relative word error rate reduction of about 5% when combined with conventional PLP features. Further experiments led on Aurora4 confirmed the robustness of the EBF to both additive and convolutive noises, with a relative improvement of 4.3% obtained by combinining them with mel filter banks.
  • Keywords
    feature extraction; filtering theory; neural nets; speech recognition; AMI meeting transcription system; ASR; DNN; EBF; automatic speech recognition; broad phonetic classes; error rate reduction; excitation based features; excitation features; hybrid acoustic model; mel filter banks; noise robust features; robust excitation; speech excitation signal; state-of-the-art deep neural network; vocal tract; Acoustics; Feature extraction; Hidden Markov models; Robustness; Speech; Speech recognition; Training; automatic speech recognition; neural networks; speech excitation signal;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178855
  • Filename
    7178855