DocumentCode :
3585082
Title :
A complete KALDI recipe for building Arabic speech recognition systems
Author :
Ali, Ahmed ; Yifan Zhang ; Cardinal, Patrick ; Dahak, Najim ; Vogel, Stephan ; Glass, James
Author_Institution :
Qatar Comput. Res. Inst., Qatar
fYear :
2014
Firstpage :
525
Lastpage :
529
Abstract :
In this paper we present a recipe and language resources for training and testing Arabic speech recognition systems using the KALDI toolkit. We built a prototype broadcast news system using 200 hours GALE data that is publicly available through LDC. We describe in detail the decisions made in building the system: using the MADA toolkit for text normalization and vowelization; why we use 36 phonemes; how we generate pronunciations; how we build the language model. We report results using state-of-the-art modeling and decoding techniques. The scripts are released through KALDI and resources are made available on QCRI´s language resources web portal. This is the first effort to share reproducible sizable training and testing results on MSA system.
Keywords :
portals; speech recognition; Arabic speech recognition systems; GALE data; LDC; MADA toolkit; MSA system; QCRI language resources Web portal; complete KALDI recipe; language model; language resources; phonemes; reproducible sizable training; text normalization; text vowelization; Acoustics; Dictionaries; Hidden Markov models; Speech; Speech recognition; Standards; Training; ASR system; Arabic; GALE; KALDI; lexicon;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
Type :
conf
DOI :
10.1109/SLT.2014.7078629
Filename :
7078629
Link To Document :
بازگشت