• DocumentCode
    669808
  • Title

    Noise robust continuous digit recognition with reservoir-based acoustic models

  • Author

    Jalalvand, Azarakhsh ; Demuynck, Kris ; Martens, Jean-Pierre

  • Author_Institution
    Multimedia Lab., iMinds, Ghent Univ., Ghent, Belgium
  • fYear
    2013
  • fDate
    12-15 Nov. 2013
  • Firstpage
    204
  • Lastpage
    209
  • Abstract
    Notwithstanding the many years of research, more work is needed to create automatic speech recognition (ASR) systems with a close-to-human robustness against confounding factors such as ambient noise, channel distortion, etc. Whilst most work thus far focused on the improvement of ASR systems embedding Gaussian Mixture Models (GMM)s to compute the acoustic likelihoods in the states of a Hidden Markov Model (HMM), the present work focuses on the noise robustness of systems employing Reservoir Computing (RC) as an alternative acoustic modeling technique. Previous work already demonstrated good noise robustness for continuous digit recognition (CDR). The present paper investigates whether further progress can be achieved by driving reservoirs with noise-robust inputs that have been shown to raise the robustness of GMM-based systems, by introducing bi-directional reservoirs and by combining reservoirs with GMMs in a single system. Experiments on Aurora-2 demonstrate that it is indeed possible to raise the noise robustness without significantly increasing the system complexity.
  • Keywords
    Gaussian processes; hidden Markov models; mixture models; recurrent neural nets; speech recognition; Aurora-2; acoustic likelihoods; alternative acoustic modeling technique; automatic speech recognition system; bidirectional reservoir; close-to-human robustness; hidden Markov model; noise robust continuous digit recognition; noise robustness; reservoir based acoustic models; reservoir computing; systems embedding Gaussian mixture models; Acoustics; Feature extraction; Hidden Markov models; Neurons; Reservoirs; Speech; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Signal Processing and Communications Systems (ISPACS), 2013 International Symposium on
  • Conference_Location
    Naha
  • Print_ISBN
    978-1-4673-6360-0
  • Type

    conf

  • DOI
    10.1109/ISPACS.2013.6704547
  • Filename
    6704547