DocumentCode :
3162834
Title :
Multilingual MLP features for low-resource LVCSR systems
Author :
Thomas, Samuel ; Ganapathy, Sriram ; Hermansky, Hynek
Author_Institution :
Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4269
Lastpage :
4272
Abstract :
We introduce a new approach to training multilayer perceptrons (MLPs) for large vocabulary continuous speech recognition (LVCSR) in new languages which have only few hours of annotated in-domain training data (for example, 1 hour of data). In our approach, large amounts of annotated out-of-domain data from multiple languages are used to train multilingual MLP systems without dealing with the different phoneme sets for these languages. Features extracted from these MLP systems are used to train LVCSR systems in the low-resource language similar to the Tandem approach. In our experiments, the proposed features provide a relative improvement of about 30% in an low-resource LVCSR setting with only one hour of training data.
Keywords :
feature extraction; multilayer perceptrons; speech recognition; annotated in-domain training data; annotated out-of-domain data; feature extraction; large vocabulary continuous speech recognition; low-resource LVCSR systems; low-resource language; multilayer perceptrons; multilingual MLP features; multilingual MLP systems; multiple languages; phoneme sets; tandem approach; Abstracts; Hidden Markov models; Nonhomogeneous media; Speech; Switches; Vocabulary; MLP features for low-resource LVCSR; Multilingual training; multilayer perceptrons;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288862
Filename :
6288862
Link To Document :
بازگشت