Title :
An ideal hidden-activation mask for deep neural networks based noise-robust speech recognition
Author :
Bo Li ; Khe Chai Sim
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
Deep neural networks (DNNs) are capable of modeling large acoustic variations. However, the performance on noisy data is still below humans´ expectations. In this work, we present an ideal hidden-activation masking (IHM) approach to improve their noise robustness. This IHM is inspired by the existing spectral masking techniques. Instead of masking away the noise-dominant components in the spectral domain, we propose to discard DNNs´ inconsistent hidden activations. The IHM is computed from the parallel data to identify hidden units that are immune to environment noise. DNNs then utilize it to improve their prediction robustness with the noise-invariant activations. Experimental results on the Aurora4 task have shown that the proposed IHM is both effective in reducing noise variations and robust to mask estimation errors.
Keywords :
neural nets; speech intelligibility; speech recognition; Aurora4 task; DNN; IHM; deep neural networks; ideal hidden-activation masking approach; noise-dominant components; noise-invariant activations; noise-robust speech recognition; spectral masking techniques; Neural networks; Noise; Noise measurement; Noise robustness; Robustness; Speech; Speech recognition; Deep Neural Networks; Noise Robustness;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853586