• DocumentCode
    3585082
  • Title

    A complete KALDI recipe for building Arabic speech recognition systems

  • Author

    Ali, Ahmed ; Yifan Zhang ; Cardinal, Patrick ; Dahak, Najim ; Vogel, Stephan ; Glass, James

  • Author_Institution
    Qatar Comput. Res. Inst., Qatar
  • fYear
    2014
  • Firstpage
    525
  • Lastpage
    529
  • Abstract
    In this paper we present a recipe and language resources for training and testing Arabic speech recognition systems using the KALDI toolkit. We built a prototype broadcast news system using 200 hours GALE data that is publicly available through LDC. We describe in detail the decisions made in building the system: using the MADA toolkit for text normalization and vowelization; why we use 36 phonemes; how we generate pronunciations; how we build the language model. We report results using state-of-the-art modeling and decoding techniques. The scripts are released through KALDI and resources are made available on QCRI´s language resources web portal. This is the first effort to share reproducible sizable training and testing results on MSA system.
  • Keywords
    portals; speech recognition; Arabic speech recognition systems; GALE data; LDC; MADA toolkit; MSA system; QCRI language resources Web portal; complete KALDI recipe; language model; language resources; phonemes; reproducible sizable training; text normalization; text vowelization; Acoustics; Dictionaries; Hidden Markov models; Speech; Speech recognition; Standards; Training; ASR system; Arabic; GALE; KALDI; lexicon;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2014 IEEE
  • Type

    conf

  • DOI
    10.1109/SLT.2014.7078629
  • Filename
    7078629