Title :
Investigation on cross- and multilingual MLP features under matched and mismatched acoustical conditions
Author :
Tuske, Zoltan ; Pinto, Joel ; Willett, Daniel ; Schluter, Ralf
Author_Institution :
Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
Abstract :
In this paper, Multi Layer Perceptron (MLP) based multilingual bottleneck features are investigated for acoustic modeling in three languages - German, French, and US English. We use a modified training algorithm to handle the multilingual training scenario without having to explicitly map the phonemes to a common phoneme set. Furthermore, the cross-lingual portability of bottleneck features between the three languages are also investigated. Single pass recognition experiments on large vocabulary SMS dictation task indicate that (1) multilingual bottleneck features yield significantly lower word error rates compared to standard MFCC features (2) multilingual bottleneck features are superior to monolingual bottleneck features trained for the target language with limited training data, and (3) multilingual bottleneck features are beneficial in training acoustic models in a low resource language where only mismatched training data is available-by exploiting the more matched training data from other languages.
Keywords :
acoustic signal processing; error statistics; linguistics; multilayer perceptrons; natural language processing; speech recognition; vocabulary; French language; German language; MFCC features; MLP based multilingual bottleneck features; SMS dictation task; US English language; acoustic modeling; cross-lingual MLP features; cross-lingual portability; mismatched acoustical conditions; mismatched training data; modified training algorithm; monolingual bottleneck features; multilayer perceptron; multilingual MLP features; multilingual training scenario; phoneme set; single pass recognition experiments; vocabulary; word error rates; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Training; Training data; MLP; bottleneck; mismatched acoustical condition; multilingual;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639090