DocumentCode
179901
Title
Training time reduction and performance improvements from multilingual techniques on the BABEL ASR task
Author
Stuker, Sebastian ; Muller, Mathias ; Quoc Bao Nguyen ; Waibel, Alex
Author_Institution
Inst. for Anthropomatics, Karlsruhe Inst. of Technol., Karlsruhe, Germany
fYear
2014
fDate
4-9 May 2014
Firstpage
6374
Lastpage
6378
Abstract
In the IARPA sponsored program BABEL we are faced with the challenge of training automatic speech recognition systems in sparse data conditions in very little time. In this paper we show that by using multilingual bootstrapping techniques in combination with multilingual deep belief bottle neck features that are only fine tuned on the target language the training time of an LVCSR system can be essentially halved while the word error rate stays the same. We show this for recognition systems on Tagalog, making use of multilingual systems trained on the other four languages of the Babel base period: Cantonese, Pashto, Turkish, and Vietnamese.
Keywords
computational linguistics; error statistics; speech recognition; statistical analysis; training; vocabulary; BABEL ASR; Babel base period; Cantonese; IARPA; LVCSR system; Pashto; Tagalog; Turkish; Vietnamese; automatic speech recognition; multilingual bootstrapping technique; multilingual deep belief bottle neck feature; multilingual systems; performance improvement; sparse data conditions; training time reduction; word error rate; Acoustics; Context modeling; Data models; Hidden Markov models; Speech recognition; Training; Training data; automatic speech recognition; multilingual speech recognition; rapid system development;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854831
Filename
6854831
Link To Document