• DocumentCode
    723329
  • Title

    Preliminary experiments on the robustness of biologically motivated features for DNN-based ASR

  • Author

    de-la-Calle-Silos, F. ; Valverde-Albacete, Francisco J. ; Gallardo-Antolin, A. ; Pelaez-Moreno, C.

  • Author_Institution
    Signal Theor. & Commun. Dept., Univ. Carlos III de Madrid, Leganés, Spain
  • fYear
    2015
  • fDate
    10-12 June 2015
  • Firstpage
    169
  • Lastpage
    176
  • Abstract
    A perceptually motivated feature extraction method based on mimicking the masking properties of the cochlea has been recently found to provide enhanced performance when applied to conventional speech recognition back-ends. On the other hand, the introduction of Deep Neural Network (DNN) based acoustic models has produced dramatic improvements in performance. In particular, we found that Deep Maxout Networks, a modification of DNNs´ feed-forward architecture that uses a max-out activation function, provides enhanced robustness to environmental noise. In this paper, we present preliminary experiments on the combination of these two elements that already show how the DMN-based back-end is capable of taking advantage of these auditorily inspired features making the whole system more robust and also suggesting that human-like representations of speech keep playing an important role in DNN-based automatic speech recognition systems.
  • Keywords
    ear; feature extraction; feedforward neural nets; hearing; speech intelligibility; speech recognition; DNN-based ASR; DNN-based automatic speech recognition system; auditorily inspired feature extraction method; cochlea masking property; deep maxout network; deep neural network based acoustic model; feature extraction method; feedforward architecture; max-out activation function; Biology; Neural networks; Noise; Robustness; Speech; Time-frequency analysis; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinspired Intelligence (IWOBI), 2015 4th International Work Conference on
  • Conference_Location
    San Sebastian
  • Print_ISBN
    978-1-4673-7845-1
  • Type

    conf

  • DOI
    10.1109/IWOBI.2015.7160162
  • Filename
    7160162