• DocumentCode
    3585080
  • Title

    A sparsity based preprocessing for noise robust speech recognition

  • Author

    Koniaris, Christos ; Chatterjee, Saikat

  • Author_Institution
    Dept. of Philos., Univ. of Gothenburg, Gothenburg, Sweden
  • fYear
    2014
  • Firstpage
    513
  • Lastpage
    518
  • Abstract
    We show a method to sparsify the speech input that improves the robustness of an automatic speech recognizer. The proposed scheme is added to the system as a preprocessing module prior to the acoustic feature extraction. The preprocessing module passes the input speech signal through a linear predictive (LP) analysis filter and enforces sparsity in the LP residue domain. The sparsified prediction residue finally is filtered to generate the speech signal for computing a sequence of conventional feature vectors used in automatic speech recognition (ASR). Using standard feature vectors, our experiments show that sparsification in LP residue domain improves robustness in ASR performance.
  • Keywords
    feature extraction; filtering theory; signal denoising; speech recognition; ASR; LP analysis filter; LP residue domain; acoustic feature extraction; automatic speech recognition; feature vectors; input speech signal; linear predictive analysis filter; noise robust speech recognition; robustness improvement; sparsified prediction residue filtering; sparsity based preprocessing module; speech input sparsification; speech signal generation; standard feature vectors; Abstracts; Accuracy; Mel frequency cepstral coefficient; Psychoacoustics; Signal to noise ratio; Speech; Speech recognition; feature extraction; linear predictive analysis; residue signal; robust speech recognition; sparsity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2014 IEEE
  • Type

    conf

  • DOI
    10.1109/SLT.2014.7078627
  • Filename
    7078627