• DocumentCode
    177463
  • Title

    An ideal hidden-activation mask for deep neural networks based noise-robust speech recognition

  • Author

    Bo Li ; Khe Chai Sim

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    200
  • Lastpage
    204
  • Abstract
    Deep neural networks (DNNs) are capable of modeling large acoustic variations. However, the performance on noisy data is still below humans´ expectations. In this work, we present an ideal hidden-activation masking (IHM) approach to improve their noise robustness. This IHM is inspired by the existing spectral masking techniques. Instead of masking away the noise-dominant components in the spectral domain, we propose to discard DNNs´ inconsistent hidden activations. The IHM is computed from the parallel data to identify hidden units that are immune to environment noise. DNNs then utilize it to improve their prediction robustness with the noise-invariant activations. Experimental results on the Aurora4 task have shown that the proposed IHM is both effective in reducing noise variations and robust to mask estimation errors.
  • Keywords
    neural nets; speech intelligibility; speech recognition; Aurora4 task; DNN; IHM; deep neural networks; ideal hidden-activation masking approach; noise-dominant components; noise-invariant activations; noise-robust speech recognition; spectral masking techniques; Neural networks; Noise; Noise measurement; Noise robustness; Robustness; Speech; Speech recognition; Deep Neural Networks; Noise Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853586
  • Filename
    6853586