DocumentCode
3585082
Title
A complete KALDI recipe for building Arabic speech recognition systems
Author
Ali, Ahmed ; Yifan Zhang ; Cardinal, Patrick ; Dahak, Najim ; Vogel, Stephan ; Glass, James
Author_Institution
Qatar Comput. Res. Inst., Qatar
fYear
2014
Firstpage
525
Lastpage
529
Abstract
In this paper we present a recipe and language resources for training and testing Arabic speech recognition systems using the KALDI toolkit. We built a prototype broadcast news system using 200 hours GALE data that is publicly available through LDC. We describe in detail the decisions made in building the system: using the MADA toolkit for text normalization and vowelization; why we use 36 phonemes; how we generate pronunciations; how we build the language model. We report results using state-of-the-art modeling and decoding techniques. The scripts are released through KALDI and resources are made available on QCRI´s language resources web portal. This is the first effort to share reproducible sizable training and testing results on MSA system.
Keywords
portals; speech recognition; Arabic speech recognition systems; GALE data; LDC; MADA toolkit; MSA system; QCRI language resources Web portal; complete KALDI recipe; language model; language resources; phonemes; reproducible sizable training; text normalization; text vowelization; Acoustics; Dictionaries; Hidden Markov models; Speech; Speech recognition; Standards; Training; ASR system; Arabic; GALE; KALDI; lexicon;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop (SLT), 2014 IEEE
Type
conf
DOI
10.1109/SLT.2014.7078629
Filename
7078629
Link To Document