Title :
Cross-language transfer learning for deep neural network based speech enhancement
Author :
Yong Xu ; Jun Du ; Li-Rong Dai ; Chin-Hui Lee
Author_Institution :
Nat. Eng. Lab. for Speech & Language Inf. Process., Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
In this paper, we propose a transfer learning approach to adapt a well-trained model obtained with high-resource materials of one language to another target language using a small amount of adaptation data for speech enhancement based on deep neural networks (DNNs). We investigate the performance degradation issues of enhancing noisy Mandarin speech data using DNN models already trained with only English speech materials, and vice versa. By assuming that the hidden layers of the well-trained DNN regression model as a cascade of feature extractors, we hypothesize that the first several layers should be transferable between languages. Our experimental results indicate that even with only about 1 minute of adaptation data from the resource-limited language we can achieve a considerable performance improvement over the DNN model without cross-language transfer learning.
Keywords :
learning (artificial intelligence); neural nets; regression analysis; speech enhancement; DNN models; English speech materials; cross-language transfer learning; deep neural network based speech enhancement; feature extractors; high-resource materials; noisy Mandarin speech data; resource-limited language; well-trained DNN regression model; well-trained model; Neural networks; Noise; Noise measurement; Speech; Speech enhancement; Training; deep neural network; multi-lingual; resource-limited language; speech enhancement; transfer learning;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
DOI :
10.1109/ISCSLP.2014.6936608