DocumentCode :
1796915
Title :
Noisy training for deep neural networks
Author :
Xiangtao Meng ; Chao Liu ; Zhiyong Zhang ; Dong Wang
Author_Institution :
Center for Speech & Language Technol., Tsinghua Univ., Beijing, China
fYear :
2014
fDate :
9-13 July 2014
Firstpage :
16
Lastpage :
20
Abstract :
Deep neural networks (DNN) have gained remarkable success in speech recognition, partially attributed to its flexibility in learning complex patterns of speech signals. This flexibility, however, may lead to serious over-fitting and hence miserable performance degradation in adverse environments such as those with high ambient noises. We propose a noisy training approach to tackle this problem: by injecting noises into the training speech intentionally and randomly, more generalizable DNN models can be learned. This `noise injection´ technique has been well-known to the neural computation community, however there is little knowledge if it would work for the DNN model which involves a highly complex objective function. The experiments presented in this paper confirm that the original assumptions of the noise injection approach largely holds when learning deep structures, and the noisy training may provide substantial performance improvement for DNN-based speech recognition.
Keywords :
learning (artificial intelligence); neural nets; speech recognition; DNN-based speech recognition; complex pattern learning; deep neural network; deep structure learning; neural computation; noise injection approach; noisy training approach; speech signal processing; Neural networks; Noise measurement; Signal to noise ratio; Speech; Speech recognition; Training; deep neural network; noise injection; robust speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-5401-8
Type :
conf
DOI :
10.1109/ChinaSIP.2014.6889193
Filename :
6889193
Link To Document :
بازگشت