DocumentCode
1689771
Title
Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers
Author
Jui-Ting Huang ; Jinyu Li ; Dong Yu ; Li Deng ; Yifan Gong
Author_Institution
Online Services Div., Microsoft Corp., Redmond, WA, USA
fYear
2013
Firstpage
7304
Lastpage
7308
Abstract
In the deep neural network (DNN), the hidden layers can be considered as increasingly complex feature transformations and the final softmax layer as a log-linear classifier making use of the most abstract features computed in the hidden layers. While the loglinear classifier should be different for different languages, the feature transformations can be shared across languages. In this paper we propose a shared-hidden-layer multilingual DNN (SHL-MDNN), in which the hidden layers are made common across many languages while the softmax layers are made language dependent. We demonstrate that the SHL-MDNN can reduce errors by 3-5%, relatively, for all the languages decodable with the SHL-MDNN, over the monolingual DNNs trained using only the language specific data. Further, we show that the learned hidden layers sharing across languages can be transferred to improve recognition accuracy of new languages, with relative error reductions ranging from 6% to 28% against DNNs trained without exploiting the transferred hidden layers. It is particularly interesting that the error reduction can be achieved for the target language that is in different families of the languages used to learn the hidden layers.
Keywords
hidden Markov models; linguistics; neural nets; speech recognition; DNN; complex feature transformations; cross-language knowledge transfer; error reductions; language specific data; learned hidden layers; log-linear classifier; multilingual deep neural network; recognition accuracy; Acoustics; Hidden Markov models; Neural networks; Speech; Speech recognition; Training; Training data; CD-DNN-HMM; deep neural network; multilingual speech recognition; multitask learning; transfer learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639081
Filename
6639081
Link To Document