DocumentCode
179602
Title
Multilingual shifting deep bottleneck features for low-resource ASR
Author
Quoc Bao Nguyen ; Gehring, Jonas ; Muller, Mathias ; Stuker, Sebastian ; Waibel, Alex
Author_Institution
Karlsruhe Inst. of Technol., Karlsruhe, Germany
fYear
2014
fDate
4-9 May 2014
Firstpage
5607
Lastpage
5611
Abstract
In this work, we propose a deep bottleneck feature architecture that is able to leverage data from multiple languages. We also show that tonal features are helpful for non-tonal languages. Evaluations are performed on a low-resource conversational telephone speech transcription task in Bengali, while additional data for DBNF training is provided in Assamese, Pashto, Tagalog, Turkish, and Vietnamese. We obtain relative reductions of up to 17.3% and 9.4% WER over mono-lingual GMMs and DBNFs, respectively.
Keywords
Gaussian processes; feature extraction; mixture models; natural language processing; speech recognition; Assamese; Bengali; DBNF training; Pashto; Tagalog; Turkish; Vietnamese; WER; low-resource ASR; low-resource conversational telephone speech transcription task; monolingual GMM; multilingual shifting deep bottleneck features; nontonal languages; tonal features; Feature extraction; Mel frequency cepstral coefficient; Speech; Speech recognition; Standards; Training; Deep Neural Networks; Low-Resource ASR; Multilingual Deep bottleneck features;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854676
Filename
6854676
Link To Document