مرکز منطقه ای اطلاع رساني علوم و فناوري - Multilingual MLP features for low-resource LVCSR systems

DocumentCode :

3162834

Title :

Multilingual MLP features for low-resource LVCSR systems

Author :

Thomas, Samuel ; Ganapathy, Sriram ; Hermansky, Hynek

Author_Institution :

Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4269

Lastpage :

4272

Abstract :

We introduce a new approach to training multilayer perceptrons (MLPs) for large vocabulary continuous speech recognition (LVCSR) in new languages which have only few hours of annotated in-domain training data (for example, 1 hour of data). In our approach, large amounts of annotated out-of-domain data from multiple languages are used to train multilingual MLP systems without dealing with the different phoneme sets for these languages. Features extracted from these MLP systems are used to train LVCSR systems in the low-resource language similar to the Tandem approach. In our experiments, the proposed features provide a relative improvement of about 30% in an low-resource LVCSR setting with only one hour of training data.

Keywords :

feature extraction; multilayer perceptrons; speech recognition; annotated in-domain training data; annotated out-of-domain data; feature extraction; large vocabulary continuous speech recognition; low-resource LVCSR systems; low-resource language; multilayer perceptrons; multilingual MLP features; multilingual MLP systems; multiple languages; phoneme sets; tandem approach; Abstracts; Hidden Markov models; Nonhomogeneous media; Speech; Switches; Vocabulary; MLP features for low-resource LVCSR; Multilingual training; multilayer perceptrons;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288862

Filename :

6288862

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3162834