Title :
Adaptation of multilingual stacked bottle-neck neural network structure for new language
Author :
Grezl, Frantisek ; Karafiat, Martin ; Vesely, Karel
Author_Institution :
Speech@FIT & IT4I Center of Excellence, Brno Univ. of Technol., Brno, Czech Republic
Abstract :
The neural network based features became an inseparable part of state-of-the-art LVCSR systems. In order to perform well, the network has to be trained on a large amount of in-domain data. With the increasing emphasis on fast development of ASR system on limited resources, there is an effort to alleviate the need of in-domain data. To evaluate the effectiveness of other resources, we have trained the Stacked Bottle-Neck neural networks structure on multilingual data investigating several training strategies while treating the target language as the unseen one. Further, the systems were adapted to the target language by re-training. Finally, we evaluated the effect of adaptation of individual NNs in the Stacked Bottle-Neck structure to find out the optimal adaptation strategy. We have shown that the adaptation can significantly improve system performance over both, the multilingual network and network trained only on target data. The experiments were performed on Babel Year 1 data.
Keywords :
feature extraction; natural language processing; neural nets; speech recognition; ASR system; Babel Year 1 data; LVCSR systems; adaptation strategy; automatic speech recognition systems; in-domain data; multilingual data; multilingual network; multilingual stacked bottle-neck neural network structure; Acoustics; Artificial neural networks; Feature extraction; Speech; Training; Training data; Bottle-neck features; Stacked BottleNeck structure; feature extraction; multilingual neural networks; neural network adaptation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6855089